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of pathogenic mycobacteria] disease or infection. This 
invention also provides for methods of screening for 
compounds capable of modulating the activity, production 
or secretion of a mycobacterium secreted acid phosphatase. 
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Secreted Acid Phosphatase {sapM) is Present Only in Pathogenic 
IMycobacteria and Expressed Selectively at Phagosomai pH 



Field of invention 

This invention relates to diagnostic methods and vaccines for pathogenic 
mycobacterial or pathogenic fungal disease or infection. 



Background 

Pathogenic mycobacteria including Mycobacterium tuberculosis fes\6e within 
the phagosome of the host macrophage, a harsh environment that is 
detrimental to most microbes. It is evident that M. tuberculosis modifies this 
phagosomai compartment in order to enhance its own intracellular survival 
(Clemens and Horwitz, 1995). This includes alterations in Rab GTPase 
composition (Via etal., 1997; Clemens eta!,, 2000) and exclusion of the 
vacuolar proton ATPase (Sturgill ef a/., 1994). Consequently, the 
mycobacterial phagosome is arrested at an eariy stage of maturation and only 
mildly acidified to pH 6,1 - 6.5 (Xu et ai, 1994; Sturgill ef a/., 1994; Clemens 
and Honwitz, 1995). The ability of pathogenic mycobacteria to persist in 
macrophages plays a crucial role in disease establishment and progression. 
However, the specific bacterial factors that undermine the host processes 
remain largely unknown. Acting on the assumption that genes contributing to 
bacterial pathogenesis are differentially expressed during infection, various 
approaches have been used to identify M. tuberculosis genes whose 
expression are preferentially upregulated in vivo (i.e., within macrophages) 
(reviewed in (Triccas and Gicquei, 2000) or under in vitro conditions that 
mimic the host environment encountered by the bacillus (e.g., low oxygen 
tension, acidic pH, and nutrient starvation) (Betts ef a/., 2002; Fisher ef a/., 
2002). Although a number of candidate virulence genes have been described, 
many have no known biological function and their direct roles on intracellular 
survival of the bacillus remain obscure. The identification of mycobacterial 
proteins that contribute to the replication and survival of the bacteria is critical 
for understanding the pathogenesis and protective mechanisms of the 
disease. 
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Intracellular bacterial pathogens including M. tuberculosis and Salmonella 
enterica serovar Typhimurium encounter a wide variety of environmental 
conditions within the host that are not accessible to commensal species. The 
ability to respond to environmental cues, and adjust to harsh intracellular 
conditions allows these pathogens to survive. Environmental cues, including 
temperature, osmolarity, pH, oxygen tension, and nutrient availability, have 
been found to act as signals for the induction of specific subsets of bacterial 
genes, which contributes to enhanced survival or multiplication. Many of these 
genes have been identified in Salmonella, and some have been shown to play 
important roles in infection and/or pathogenesis (reviewed in (Lucas and Lee, 
2000). Using a variety of strategies, a number of candidate virulence genes of 
M. tuberculosis have also been identified (Triccas and Gicquel, 2000). Yet 
direct evidence supporting a necessary role in virulence is only available for a 
few genes, e.g., isocitrate lyase {aceA) (McKinney et aL, 2000), a-crystallin 
(acr) (Yuan et aL, 1998), and the exported repetitive protein {erp) (Berthet ef 
a/.. 1998). 

It is well known that pathogenic mycobacteria actively modify the phagosomal 
pH. Phagosomes containing viable pathogenic M. tuberculosis or M. avium 
are only mildly acidified (pH 6 - 6.5), which is, at least partially, due to the 
paucity of proton ATPase (Sturgill et ai, 1994). In contrast, nonpathogenic 
mycobacteria such as M. smegmatis and Mycobacterium gordonae, or dead 
pathogenic mycobacteria, do not block phagosomal acidification and the pH of 
phagosomes containing these bacteria is acidified to values similar to the pH 
of late endbsomes and lysosomes (pH «5) (Kuehnel etal., 2001). These data 
suggest that productions of factors found only in pathogenic strains are 
Important for inhibition of phagosomal acidification and intracellular survival. 
Acidification may be a signal to induce expression of genes needed to alter 
phagosomal maturation. SapM, the secreted acid Qhosphatase of M. 
tuberculosis H37Rv, was identified recently (Saleh and Belisle, 2000). 

Summary 

This invention provides an isolated DNA sequence comprising a promoter or 
promoter fragment from the 5' flanking region upstream of the start site of the 
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coding region of a mycobacterial secreted acid phospliatase gene [SEQ ID 
NO:9], [SEQ ID NO:11], [SEQ ID NO:13]. [SEQ ID NO:15] wherein the 
promoter or promoter fragment is sufficient to control expression of a 
nucleotide sequence of Interest and is inducible under low-pH conditions. 
This invention also provides an isolated DNA sequence comprising a 
promoter or promoter fragment from the 5' flanking region upstream of the 
start site of the coding region of a secreted acid phosphatase gene selected 
from the group consisting of Mycobacterium tuberculosis [SEQ ID NO:9], 
Mycobacterium bovis [SEQ ID NO:1 1], Mycobacterium avium [SEQ ID 
NO:13], or Mycobacterium marinum [SEQ ID NO:15]. 
The present invention relates to an isolated DNA sequence comprising a 
promoter or promoter fragment which hybridizes to [SEQ ID NO:1], [SEQ ID 
NO:2], [SEQ ID NO:3], [SEQ ID N0:4] under high stringency hybridization 
conditions and which is sufficient to control expression of a nucleotide 
sequence of interest and is inducible under low pH conditions. 
The present invention also relates to an expression vector comprising an 
isolated DNA sequence which comprises a promoter or promoter fragment 
from the 5' flanking region upstream of the start site of the coding region of a 
mycobacterial secreted acid phosphatase gene [SEQ ID NO: 9], [SEQ ID 
NO:11], [SEQ ID NO:13], [SEQ ID NO:1 5] wherein the promoter or promoter 
fragment is sufficient to control expression of a nucleotide sequence of 
interest and is inducible under low-pH conditions. 
The present invention also relates to an expression vector comprising ari. 
isolated DNA sequence which comprises a promoter or prorhoter fragment 
from the 5' flanking region upstream of the start site of the coding region of a 
secreted acid phosphatase gene selected from the group consisting of 
Mycobacterium tuberculosis [SEQ ID NO:9], Mycobacterium bovis [SEQ ID 
NO:1 1], Mycobacterium avium [SEQ ID NO:13], or Mycobacterium marinum 
[SEQ ID NO:15]. 

The present invention also relates to an expression vector comprising an 
isolated DNA sequence which comprises a promoter or promoter fragment 
which hybridizes to [SEQ ID NO:1], [SEQ ID NO:2]. [SEQ ID NO:3], [SEQ ID 
NO:4] under high stringency hybridization conditions and which is sufficient to 
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control expression of a nucleotide sequence of interest and is inducible under 
low-pH conditions. 

The invention further relates to a host cell transformed with an expression 
vector comprising an isolated DNA sequence which comprises a promoter or 
promoter fragment from the 5' flanking region upstream of the start site of the 
coding region of a mycobacterial secreted acid phosphatase gene [SEQ ID 
NO:9], [SEQ ID NO:11], [SEQ ID NO:13], [SEQ ID NO:15] wherein the 
promoter or promoter fragment is sufficient to control expression of a 
nucleotide sequence of interest and Is inducible under low-pH conditions. 
The invention further relates to a host cell transformed with an expression 
vector comprising an isolated DNA sequence which comprises a promoter or 
promoter fragment from the 5' flanking region upstream of the start site of the 
coding region of a secreted acid phosphatase gene selected from the group 
consisting of Mycobacterium tuberculosis [SEQ ID NO:9], Mycobacterium 
bovis [SEQ ID NO:11], Mycobacterium avium [SEQ ID NO:13], or 
Mycobacterium marinum [SEQ I D NO: 1 5] . 

The invention further relates to a host cell transformed with an expression 
vector comprising an isolated DNA sequence which comprises a promoter or 
promoter fragment which hybridizes to [SEQ ID NO: 1], [SEQ ID NO:2], [SEQ 
ID NO:3], [SEQ ID NO:4] under high stringency hybridization conditions and 
which is sufficient to control expression of a nucleotide sequence of interest 
and is inducible under low-pH conditions. 

Another aspect of the invention is a transcription cassette comprising a sapM 
promoter or a promoter fragment [SEQ ID NO 1:], [SEQ ID NO:2], [SEQ ID 
NO:3], [SEQ ID NO:4], a nucleotide sequence of interest operably linked to 
the sapM promoter or promoter fragment [SEQ ID NO :1], [SEQ ID NO:2], 
[SEQ ID NO:3], [SEQ ID NO:4] and a transcriptional temnination region. In 
one embodiment the transcription cassette further comprises a mycobacterial 
secreted acid phosphatase N-terminal signal sequence [SEQ ID NO:5], [SEQ 
ID NO:6], [SEQ ID NO:7], [SEQ ID NO:8] or a functional portton thereof. In 
another embodiment the transcription cassette further comprises the secreted 
acid phosphatase N-terminal signal sequence selected from the group 
consisting of Mycobacterium tuberculosis [SEQ ID NO:5], Mycobacterium 
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bovis [SEQ ID NO:6], Mycobacterium avium [SEQ ID NO:7], or 
Mycobacterium marinum [SEQ ID NO:8]. 

In yet another aspect of the invention there is a method for the diagnosis of a 
pathogenic mycobacterial Infection or a pathogenic fungal infection in a 
subject comprising obtaining a biological sample from a subject and analyzing 
the sample for the presence of antibodies specific to SapM [SEQ ID NO: 10], 
[SEQ ID NO:12], [SEQ ID NO:14], [SEQ ID NO:16] wherein detection of 
antibodies specific to SapM [SEQ ID NO:10], [SEQ ID NO:12]. [SEQ ID 
NO:14], [SEQ ID NO:16] is indicative of the presence of the pathogenic 
mycobacterial infection or the pathogenic fungal infection. 
In one aspect of the invention there is a method for the diagnosis of a 
pathogenic mycobacterial infection or a pathogenic fungal infection in a 
subject comprising obtaining a nucleic acid sample from a subject and 
analyzing the sample for the presence of nucleic acid encoding sapM [SEQ ID 
NO:9], [SEQ ID NO:11], [SEQ ID NO:13]. [SEQ ID NO:15l wherein detection 
ofsapM [SEQ ID NO:9]. [SEQ ID NO:11]. [SEQ ID NO:13]. [SEQ ID NO:15] 
is indicative of the presence of the pathogenic mycobacterial infection of the 
pathogenic fungal infection. In one embodiment the method for the diagnosis 
of a pathogenic mycobacterial infection or a pathogenic fungal infection 
further comprises the step of quantifying the amount of sapM [SEQ ID NO: 9], 
[SEQ ID NO:11]. [SEQ ID NO:13], [SEQ ID NO:15] detected. 
Another aspect of the invention is a method for the diagnosis of a pathogenic 
mycobacterial infection or a pathogenic fungal infection in a subject 
comprising obtaining a biological sample from a subject and analyzing the 
sample for the presence of SapM phosphatase activity wherein detection of 
SapM phosphatase activity is Indicative of the presence of the pathogenic 
mycobacterial infection or the pathogenic fungal infection. 
The invention also relates to a method of screening for a compound capable 
of modulating or regulating production of SapM [SEQ ID NO:10], [SEQ ID 
N0:12], [SEQ ID NO:14], [SEQ ID NO:16] comprising providing or employing 
a nucleic acid construct comprising the sapM promoter [SEQ ID NO:1], [SEQ 
ID NO:2], [SEQ ID NO:3], [SEQ ID NO:4] or a functional part thereof, the 
promoter or the functional part thereof being operably linked to a reporter 
gene capable of producing a detectable or a measurable signal, exposing the 
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construct to a candidate or test compound and detecting or measuring the 



produced in the presence pf the candidate or test compound as compared to 
in the absence of the test compound Indicates that the test compound is 
capable of modulating production of the mycobacterial secreted acid 
phosphatase. In one embodiment a plurality of compounds is tested. 
The invention also relates to a method of screening for a compound capable 
of modulating or regulating SapM phosphatase activity comprising incubating 
a mixture comprising SapM [SEQ ID NO:10], [SEQ ID NO:12], [SEQ ID 
NO:14], [SEQ ID NO:16] a SapM substrate, and the compound to be tested, 
and measuring the phosphatase activity, wherein a change in activity in the 
presence of the test compound as compared to in the absence of the test 
compound indicates that the test compound is capable of modulating the 
activity of the secreted acid phosphatase. In one embodiment a plurality of 
compounds is tested.. 

The invention further relates to a method of screening for a compound 
capable of regulating or modulating SapM [SEQ ID NO:10], [SEQ ID NO:12], 
[SEQ ID NO:14], [SEQ ID NO:16] secretion comprising exposing 
mycobacterial ciells to a compound to be tested, wherein the mycobacterial 
cells secrete SapM [SEQ ID NO:101, [SEQ ID NO:12], [SEQ ID NO:14], [SEQ 
ID NO:16] detecting the presence or activity of SapM [SEQ ID NO:10], [SEQ 
ID NO:12], [SEQ ID NO:14], [SEQ ID NO:16] secreted by the mycobacterial 
cells, and measuring the amount of the mycobacterial secreted acid 
phosphatase secreted by the mycobacterial cells, wherein a change in the 
secretion of the mycobacterial secreted acid phsophatase in the presence of 
the test compound as compared to in the absence of the test compound 
indicates that the test compound is capable of modulation of the secretion of 
the mycobacterial secreted phosphatase. In one embodiment a plurality of 
compounds is tested. 

The invention also relates to a kit for the detection of a pathogenic 
mycobacterial disease or infection comprising a SapM protein [SEQ ID 
NO:10]. [SEQ ID NO:12]. [SEQ ID NO:14], [SEQ ID NO:16] or polypeptide 
thereof, at least one SapM antibody, and one or more reagents necessary for 
detection of SapM specific antibodies. 



signal produced by the reporter gene, wherein a change in the signal 



6 



wo 2004/033677 



PCT/CA2003/001554 



The invention also relates to a kit for the detection of a pathogenic 
mycobacterial disease or infection comprising an oligonucleotide comprising 
contiguous nucleotides from the nucleic acid sequence that is complementary 
to the sequence of [SEQ ID NO:1], [SEQ ID NO:2], [SEQ ID NO:3], [SEQ ID 
NO:4], [SEQ ID NO:9], [SEQ ID NO:11]. [SEQ ID NO:13], [SEQ ID NO:15] 
and capable of specifically hybridizing to the complementary nucleotide 
sequence and reagents for hybridization of the oligonucleotide to a 
complementary nucleic acid sequence. 

In another aspect of the invention there is an antibody which is capable of 
specifically binding to a SapM protein [SEQ ID NO:10], [SEQ ID NO:12], [SEQ 
ID NO:14], [SEQ ID NO:16l or a polypeptide fragment thereof [SEQ ID 
NO:22]. 

In yet another aspect of the invention there is a vaccine or immunogenic 
composition for treatment or prophylaxis of a mammal against challenge by a 
mycobacterium comprising an antibody which is capable of specifically 
binding to a SapM protein [SEQ ID NO:10], [SEQ ID NO:12], [SEQ ID NO:14], 
[SEQ ID NO:16] or a polypeptide fragment thereof [SEQ ID NO:22], 
In a further aspect of the invention there is a vaccine or immunogenic 
composition for treatment or prophylaxis of a mammal against challenge by a 
mycobacterium comprising an isolated DNA sequence comprising a promoter 
or promoter fragment from the 5' flanking region upstream of the start site of 
the coding region of a mycobacterial secreted acid phosphatase gene [SEQ 
ID NO:1], [SEQ ID NO:2], [SEQ ID NO:3], [SEQ ID NO:4] wherein the 
promoter or promoter fragment is sufficient to control expression of a 
nucleotide sequence of interest and is inducible under low-pH conditions. 
Another aspect of the invention relates to a vaccine or immunogenic 
composition for treatment or prophylaxis of a mammal against challenge by a 
mycobacterium comprising an isolated DNA sequence comprising a promoter 
or promoter fragment which hybridizes to [SEQ ID NO:1], [SEQ ID NO:2], 
[SEQ ID NO:3], [SEQ ID NO:4] under high stringency hybridization conditions 
and which is sufficient to control expression of a nucleotide sequence of 
interest and is inducible under low pH conditions. 

The invention also relates to a vaccine or immunogenic composition for the 
treatment or prophylaxis of a mammal against challenge by a mycobacterium 
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comprising a SapM protein [SEQ ID NO:10], [SEQ ID NO:12], [SEQ ID 
NO:14], [SEQ ID NO:16] or a polypeptide fragment thereof [SEQ ID NO:22]. 
Another aspect of the invention relates to a vaccine or immunogenic 
composition for treatment or prophylaxis of a mammal against challenge by a 
mycobacterium comprising an isolated DNA sequence or sequence fragment 
which hybridizes to [SEQ ID NO:9], [SEQ ID NO:11], [SEQ ID NO:13], [SEQ 
ID NO:15] under high stringency hybridization conditions and which allows for 
expression of a SapM protein [SEQ ID NO:10]. [SEQ ID NO:12], [SEQ ID 
NO:14], [SEQ ID NO:"16] or a polypeptide fragment thereof [SEQ ID NO:22]. 
The invention further relates to an antigenic composition useful for the 
detection of a pathogenic mycobacterial disease or infection in a subject 
comprising a SapM polypeptide and substantially free from other proteins or 
glycoproteins with which it is natively admixed in a culture of pathogenic 
mycobacteria. 
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Detailed Description of the Drawings 

Preferred embodiments of the invention will be described in relation to the 
drawings in which: 

Figure 1. (A) Acid phosphatase activity in the culture supematant of M. bovis 
BCG in response to nutrient stress. BCG strains, in triplicate, were grown to 
early stationary phase in standard, asparagine-containing Sauton medium, 
low phosphate Sauton, low glycerol Sauton, or ammonium-containing Sauton, 
and the acid phosphatase activity in the culture supernatants were determined 
as described under "Experimental Procedures". (B) SapM expression in M. 
bovis BCG grown in Sauton media. Equal amounts of protein (6 |xg) were 
loaded in each lane and visualized by silver staining (top panel) or by Western 
blotting with SapM antisera (lower panel). Lanes 2, 4, 6, and 8 are from 
culture supematant of BCG-Frappier, -Pasteur, - Birkhaug, and -Japan grown 
in nomnal Sauton medium, respectively. Lanes 3, 5. 7, and 9 are the 
corresponding strains grown in ammonium-containing Sauton medium. Lane 
10 is partially purified SapM from mc^-155/pSAP, and lane 1 is molecular 
weight marker. 

Figure 2. pl-l-dependent expression of SapM in M. bows BCG-Birkhaug. 

(A) Acid phosphatase activity determined in CFP fractions of BCG-Birkhaug 
cultures grown in Sauton medium In which the pH was adjusted to indicated 
levels by appropriate buffers. Results are from triplicate cultures. (B) The 
same CFP fractions as in panel (A) were subjected to SDS-PAGE analysis. 
Equal amounts of protein (7 |j.g) were loaded in each lane and visualized by 
silver staining (top panel) or by Western blotting with SapM antisera (lower 
panel). Partially purified SapM, indicated by the arrow, was included as a 
control. (C) Cell surface-associated acid phosphatase (open bars) and 
alkaline phosphatase (filled bars) activity from BCG-Birkhaug. The activities 
are detected from equal numbers of cell as described under "Experimental 
Procedures". 

Figure 3. Cloning and expression of M. tuberculosis sapM in M. 
smegmatis. (A) Construction of pSAP. A 3 kb Nliel fragment of the M. 
tuberculosis genome (BAC403) was ligated to Xbal-linearized pMD31 . The 
resulting plasmid, pSAP, contains upp, sapM (Rv3310) and sapC (Rv3311). 
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Restriction enzyme sites: B Bgl II. H H/ndlll. K Kpnl. V EcoRV, and XXbal. 
(B) Expression of sapM in M. smegmatis. Cells of M. smegmatis were grown 
in normal asparagine-containing Sauton {lanes 1 and 3) or ammonium- 
containing Sauton media {lanes 2 and 4), and the CFP fractions were 
subjected to SDS-PAGE and visualized by silver staining (top panel) or by 
Western blotting with SapM antisera (lower panel). Lanes 1 , 2: mc^- 
155/pSAP, lanes 3, 4: mc^-155/pMD31. Arrow indicates SapM. 
Figure 4. pH-induced expression of M. tuberculosis sapM. (A) M. 
smegmatis recombinant strain mc^-155/pSAP (filled bars) and control strain 
mc^-155/pMD31 (open bars) were grown in Sauton media adjusted to the 
indicated pH and the acid phosphatase activity in the CFP fractions were 
determined. (B & C) The same CFP fractions from panel A were subjected to 
SDS-PAGE analysis and visualized by (B) silver staining or (C) Westem 
blotting. The top panel is from mc^-155/pMD31 and the lower panel from m<^- 
155/pSAP. Partially purified SapM, indicated by arrow, was included as a 
control. 

Figure 5. (A) Construction of GFP-fusion vectors. A 2.5 kb EcoRV fragment of 
pSAP was ligated to EcoRV-linearized GFP-fusion vector pFPV27. The 
resulting pSAPC-GFP contains the entire sapM gene and a truncated sapC 
gene transcriptionaly fused to gfp. Vector pSAPC-GFP was then cut with 
Bg/ll. The 3.8 kb and 2.5 kb fragments, comprising sapM, gfp and the vector 
backbone, were re-ligated. The resulting pSAPM-GFP contains a truncated 
sapM gene transcriptionaly fused to gfp. (B) J774 cells were infected with M. 
marinum bearing pSAPM-GFP (panels A-C), pSAPC-GFP (panels D-F), or 
cloning vector pFPV27 (panels G-H) and examined by microscope. M. 
marinum cells were grown in Sauton medium (pH 7.4) and J774 cells were 
infected with the bacteria for 24 hrs before analysis. Images were collected for 
each view in the DiC mode (panels A, D, and G), in the red fluorescence 
channel for Texas-red phalloidin (panels B, E, and H), and in the green 
fluorescence channel (panels C, F, and I). The intracellular bacteria are 
designated by arrows and extracellular bacteria are designated by 
arrowheads. 

Figure 6. (A) Alignment of SapM homologs found in the genome database of 
M. tuberculosis, M. bows, M. avium and M. marinum. Arrow indicates putative 
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signal sequence cleavage site. (B) Southern bolt analysis. Plasmid pSAP, and 
chromosomal DNA isolated from in M. bovis BCG, M. smegmatis, and M. 



transferred to a nylon membrane, and hybridized with a radiolabeled sapM 
probe. 

Figure 7. Reactivity of human sera with partially purified SapM from mc^- 
155/pSAP. Partially purified SapM was run on SDS-PAGE and transferred to 
nitrocellulose membranes. Individual lanes were cut and probed with a 1:150 
dilution of human sera. Lanes: 1 , rabbit SapM antisera generated against a 
synthetic peptide as described under " Experimental Procedures"; 2. sera 
from a healthy individual with a positive PPD test; 3 and 6, healthy individuals 
with unknown PPD reactivity; 4, patient with lymph node tuberculosis who has 
been on chemotherapy for 4 months; 5, patient with evident M. avium in 
sputum; 7, healthy Individual who was vaccinated with BCG in 1977; 8, sera 
from a healthy individual who was vaccinated with BCG in 1953. 



chelonae were digested with EcoRI or EcoRV . separated on an agarose gel. 
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Detailed Description 

The present invention will now be described more fully with reference to the 
accompanying drawings, in which preferred embodiments of the invention are 
shown. This invention may, however, be embodied in different forms and 
should not be construed as limited to the embodiments set forth herein. 
Rather, these embodiments are provided so that this disclosure will be 
thorough and complete, and will fully convey the scope of the invention to 
those skilled in the art. 

The term "antibody" as used herein, is meant to include both the native 
antibody, and biologically active derivatives of antibodies, such as, for 
example, Fab', F(ab').sub.2 or Fv as well as single-domain and single-chain 
antibodies. A biologically active derivative of an antibody retains the ability to 
bind antigen. 

The tenm "isolated DNA sequence" as described herein includes DNA whether 
single or double stranded. The sequence is isolated and/or purified (i.e. from 
its natural environment), in substantially pure or homogeneous form, free or 
substantially free of nucleic acid or genes of the species of interest or origin 
other than the promoter or promoter fragment sequence. The DNA sequence 
according to the present invention may be wholly or partially synthetic. The 
term "isolated" encompasses all these possibilities. 

The temi "low-pH conditions" as used herein refers to mildly acidic conditions 
with a pH range of 5.0 to 7.0, more preferably 5.8 to 6.6, and most preferably 
6.2. 

The tenm "oligonucleotide" as described herein refers to a probe or primer that 
is single-stranded DNA or RNA, or analogs thereof, that has a sequence of 
nucleotides that includes at least 14, preferably at least 20, more preferably at 
least 50, contiguous bases that are the same as (or the complement of) any 
14 or more contiguous bases set forth in any of SEQ ID NO:1 to SEQ ID 
NO:4, [SEQ ID NO:9], [SEQ ID NO:11], [SEQ ID NO:13], [SEQ ID NO:15]. 
Preferred regions from which to construct probes include 5' and/or 3' coding 
regions of SEQ ID NO: 9. [SEQ ID NO:11], [SEQ ID NO:13]. [SEQ ID NO:15] . 
In addition, the entire cDNA encoding region of SEQ ID NO:18 , or the entire 
sequence corresponding to SEQ ID NO: 20, may be used as a probe. Probes 
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may be labeled by methods well-known in the art, as described hereinafter, 
and used in various diagnostic kits. 

The term "operably linked" as described herein means joined as part of the 
same nucleic acid molecule, suitably positioned and oriented for transcription 
to be initiated from the same promoter. 

The term "promoter" as described herein refers to a sequence of nucleotides 

from which transcription may be initiated on DNA operably linked downstream 

(i.e. in the 3* direction on the sense strand of double-stranded DNA). 

The term "promoter activity" as described herein refers to the ability to initiate 

transcription. 

The tenrn "transcription cassette" as described herein refers to a nucleic acid 
sequence encoding a nucleic acid that is transcribed. To facilitate 
transcription, nucleic acid elements such as promoters and enhancers and 
transcriptional termination sequences are typically included in the transcription 
cassette. 

We have found that SapM Is an important determinant of mycobacterial 
pathogenesis. 

In contrast to many bacterial acid phosphatases, the expression of SapM is 
not regulated by environmental inorganic phosphate concentration. Instead, it 
is regulated by pH. Pathogenic mycobacteria produce high levels of SapM 
under mildly acidic pH, and expression is maximal at pH 6.2. The sapM gene 
is present in pathogenic mycobacteria only. The sapM gene Is selectively 
expressed in host macrophages, and is important for the virulence and 
pathogenesis of mycobacteria including M. tuberculosis. The expression of 
M, tuberculosis sapM is induced by the intracellular pH, and this gene Is 
specific to pathogenic mycobacteria. SapM possess unique enzymatic 
activity and is an important determinant of mycobacterial pathogenesis. Unlike 
other bacterial acid phosphatases, SapM exhibits unusually high activity 
against GTP, which may affect the cellular GTP content and subsequently the 
balance of active to inactive forms of Rab proteins. This is particulariy relevant 
since SapM is a small protein (28 kD) secreted extracellulariy and is likely to 
gain access to the cytosol of the macrophages. The importance of SapM 
function in mycobacterial pathogenesis is further illustrated by the remarkable 
finding that SapM is present in pathogenic mycobacteria but not in 
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nonpathogenic mycobacteria. SapM homologs are found in M. tuberculosis, 
M. bovis, M. bovis BCG, M. avium and M. marinum, all of which are 
pathogenic. In contrast, neither the SaplVI protein nor the sapM gene is 
detected in nonpathogenic, environmental mycobacteria such as M. 
smegmatis and M. chelonae. 

Promoter Fragment 

The present invention provides an isolated DNA sequence comprising a 
promoter or promoter fragment, the promoter or promoter fragment 
comprising a nucleotide sequence as provided in [SEQ ID NO:1], [SEQ ID 
NO:2], [SEQ ID NO:3], [SEQ ID NO:4]. The promoter Or promoter fragment 
may comprise one or more fragments of the nucleotide sequence provided in 
[SEQ ID NO:1], [SEQ ID NO:2], [SEQ ID NO:3], [SEQ ID NO:4] sufficient to 
control expression of a nucleotide sequence of interest and which is inducible 
under low-pH conditions. The promoter or promoter fragment may comprise a 
sequence of nucleotides 5' to position 498 as shown in [SEQ ID NO:1] in 
Mycobacterium tuberculosis or an equivalent sequence in other pathogenic 
mycobacterium or pathogenic fungi. 

Restriction enzyme or nucleases may be used to digest the nucleic acid 
followed by an appropriate assay to determine the minimal nucleotide 
sequence required for promoter activity. 

The pronrloter or promoter fragment may comprise one or more sequence 
motifs or elements conferring developmental and/or tissue-specific regulatory 
control of expression or regulatory control by ah exogenous substrate or 
environmental condition. For example, the promoter or promoter fragment 
may comprise a lung-specific regulatory control element or an element that is 
regulated by an exogenously added sugar, such as isopropyl-beta-D- 
thiogalactopyranoside. 

The present Invention extends to a promoter or promoter fragment which has 
a nucleotide sequence which is an allele, mutant, variant or derivative, by way 
of nucleotide addition, insertion, substitution or deletion. Alteration to the 
nucleotide sequence may be perfomried by techniques known to those skilled 
in the art. One or more alterations to a promoter or promoter fragment 
sequence or fragment of the present invention may increase or decrease 
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promoter activity, or decrease the magnitude of the effect of a compound or 
protein cable of modulating the activity of the promoter or the promoter 
fragment 

Expression Vector 

The present invention also provides an expression vector comprising a 
promoter or promoter fragment as disclosed herein. The promoter or 
promoter fragments can be cloned into a variety of vectors by means that are 
well known in the art. Such a vector may comprise a suitably positioned 
restriction site or other means for insertion of a nucleic acid sequence of 
interest into the vector that is operably linked to the promoter or promoter 
fragment of the current invention. Such a vector may comprise a suitably 
positioned restriction site or other means for insertion of a nucleotide 
sequence of interest into the vector that Is operably linked to the promoter or 
promoter fragment. 

Suitable vectors can be chosen or constructed containing appropriate 
regulatory sequences, including the promoter sequences or fragments 
thereof, terminator fragments, polyadenylation sequences, enhancer 
sequences, marker genes and other sequences as appropriate. 
For use in an assay or experiment, commercially available vectors such as 
the pET-series (Stratagene), pPRO-series (Clontech), pTet-series (Clontech), 
BacPAK systems (Clontech) may be employed. For use in gene therapy, 
vectors such as pT-Rex. pIND, pcDNA, pVAX1, and pEF may be employed. 
Expression vectors are useful to provide high levels of polypeptide 
expression. Cell cultures transformed with the DNA sequences of the current 
Invention are useful as research tools particularly for studies of the effects of 
SapM on phagosome trafficking and maturation. Cell cultures may be used in 
overexpression and research according to numerous techniques known in the 
art. For example, a cell line (either an immortalized cell culture or a primary 
cell culture) may be transfected with a vector containing a transcription 
cassette of the current invention which permits the assessment of the levels of 
nucleic acid of interest produced, the polypeptide, the functionality and the 
phenotype of the cells produced. 
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For further details see, for example, Molecular Cloning: a Laboratory Manual: 
2"*^ edition, Sambrook et aL, 1989 Cold Spring Harbor Laboratory Press. 
Procedures for introducing DNA.into cells depends on the host used, but are 
well known, such as electroporation and calcium phosphate transformation 
methods- 
Host Cell 

A further aspect of the present invention provides a host cell containing a 
transcription cassette of the current invention. Particularly desirable host cells 
include M. tuberculosis, M.marinum, M.bovis, M.bovis BCG, M. smegmatis, 
members of the M.avium complex, E.coli (and derivatives such as E. coli 
BL21(DE3) ), insect cell lines*, such as S^1. for protein overexpression, and 
mammalian cell lines, such as RAW, and J774. Examples of suitable genes 
to be expressed under control of the sapM promoter are: antigen 85 genes 
(fbpA, fbpB, fbpC), 19-Kd lipoprotein gene (Ippo) and alpha-crystallin gene 
(acr). 

Methods known in the art for transformation, include, but are not limited to, 
electroporation, rubidium chloride, calcium chloride ,calcium phosphate or 
chloroquine transfection, viral infection, phage transduction and 
microinjection, and the use of cationic lipid and lipid/amino acid complexes, or 
of liposomes, or a large variety of other commercially available, and readily 
synthesized transfection adjuvants, are useful to transfer a SapM nucleic acid 
molecule into host cells. 

Host cells are cultured in conventional nutrient media. The media may be 
modified as appropriate for inducing promoters, amplifying nucleic acid 
sequences of interest or selecting transformants. The culture conditions, such 
as temperature, composition and pH will be apparent. After transformation, 
transformants may be identified on the basis of selectable phenotype. 

Method of Diagnosing 

Detection of sapM nucleic acid, SapM protein, or SapM phosphatase activity 
is useful as a screening tool for the presence of a pathogenic mycobacterial or 
a pathogenic fungal infection or to monitor the progression of a pathogenic 
mycobacterial or a pathogenic fungal infection. 
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The present invention relates to a method for the diagnosis of a pathogenic 
mycobacterial infection or a pathogenic fungal infection in a subject 
comprising: 

a) obtaining a biological sample from a subject; and 

b) analyzing the sample for the presence of antibodies specific to SapM 
[SEQ ID NO:10]. [SEQ ID NO:12];[SEQ ID NO:14], [SEQ ID NO:16] 
wherein detection of antibodies specific to SapM [SEQ ID NO:10], 
[SEQ ID NO:12]. [SEQ ID NO:14], [SEQ ID NO:16] is indicative of the 
presence of the pathogenic mycobacterial or pathogenic fungal 
infection. 

The sample may be analyzed for the presence of antibodies specific to SapM 
by using SapM in immunoassays wherein SapM can be utilized in liquid 
phase or bound to a solid phase carrier. In addition, SapM can be detectably 
labeled in various ways for use in immunoassays with anti-SapM antibodies. 
The prefenred Immunoassays for detecting anti-SapM antibodies using the 
methods of this invention include radioimmuno-assays, enzyme-linked 
immunosorbent assays (ELISA), or other assays known in the art, such as 
immunofluorescent assays, chemiluminescent assays, or bioluminescent 
assays. 

The present invention relates to a method for the diagnosis of a pathogenic 
mycobacterial infection or a pathogenic fungal infection in a subject 
comprising: 

a) obtaining a nucleic acid sample from a subject; and 

b) analyzing the sample for the presence of nucleic acid encoding SapM, 
wherein detection of sapM is indicative of the presence of the 
pathogenic mycobacterial or pathogenic fungal infection. 

The sample may be analyzed for the presence of nucleic acid encoding SapM 
by, for example, PGR analysis, DNA sequencing, SSCP analysis, or RFLP 
analysis. 

In another embodiment the present invention relates to a method for the 
diagnosis of a pathogenic mycobacterial infection or a pathogenic fungal 
infection in a subject comprising: 

a) obtaining a biological fluid sample from a subject; and 
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b) analyzing the sample for the presence of SapM phosjDhatase activity, 
wherein detection of SapM phosphatase activity is indicative of the 
presence of the pathogenic mycobacterial or pathogenic fungal 
infection. 



by performing the acid phosphatase assay using GTP and/or NADPH as 
cofactors and p-nitrophenylphosphate as substrate. 
Measurement of sapM nucleic acid, SapM protein, or SapM phosphatase 
activity may be used not only for the diagnosis of a pathogenic mycobacterial 
or a pathogenic fungal infection but also to monitor therapeutic response, 
assess prognosis, assess patient disease risk and to monitor the success of 
disease preventative interventions In patients at risk. 

Other assays (as well as variations of the above assay) will be apparent from 
the description of this invention and techniques. 

Method of Screening 

The present invention also relates to a method of screening a synthetic 
compound or protein that regulates or modulates promoter activity of the 
promoter or promoter fragment DNA of the present invention, A method of 
identifying a synthetic compound or protein that regulates or modulates the 
promoter activity of the SapM promoter or promoter fragment comprises: 

a) employing or providing a nucleic acid construct comprising a functional 
part of the SapM promoter [SEQ ID NO:1], [SEQ ID NO:2], [SEQ ID 
N0:3], [SEQ ID NO:4] the functional part of the promoter being 
operably linked to a reporter gene capable of producing a detectable 
signal; 

b) exposing the nucleic acid construct to candidate compounds or 
proteins; and 

c) comparing the signal produced in the absence of the compound or 
protein tested. 

The reporter gene preferably encodes an enzyme which catalyses a reaction 
that produces a detectable signal. Many examples are known, for example, 
gfp encoding the green fluorescent protein (and gfp variants encoding 
enhanced green, blue/cyan and yellow fluorescent proteins), DsRed encoding 



The sample may be analyzed for the presence of SapM phosphatase activity 



18 



wo 2004/033677 



PCT/CA2003/001554 



the red fluorescent protein, lacZ encoding Beta-galactosidase, gus encoding 
Beta-glucoronidase, luxAB {or lucCDABE) encoding bacterial luciferase, luc 
encoding firefly luciferase, and phoA encoding alkaline phosphatase. 
Those skilled in the art are well aware of a multitude of possible reporter 
genes and assay techniques that may be used to determine gene activity. 
Any suitable reporter/assay may be used and it should be appreciated that no 
particular choice is essential to or a limitation of the present invention. 
The reporter gene may be used in an in vitro or an in vivo expression system. 
In addition to a promoter or a promoter fragment, expression generally 
requires the presence of a translational initiation region and transcriptional 
and translational termination regions. . 

Another embodiment of the current invention relates to a method of screening 
a synthetic compound or protein that regulates or modulates 3apM 
phosphatase activity comprising: 

a) incubating a mixture comprising SapM [SEQ ID NO:10], [SEQ ID 
NO:12], [SEQ ID NO:14], [SEQ ID NO:16], a SapM substrate, and the 
compound to be tested; 

b) measuring the phosphatase activity; and 

c) comparing the phosphatase activity in the absence of the compound or 
protein to be tested. 

The substrate may comprise pNPP, GTP or NADPH. 

Another embodiment of the current invention relates to a method of screening 
a synthetic compound or protein that regulates or modulates SapM secretion 
comprisinig: 

a) exposing mycobacterium cells to a compound or protein to be tested, 
wherein the mycobacterium cells secrete SapM [SEQ ID NO:10]; [SEQ 
ID NO:12], [SEQ ID NO:14], [SEQ ID NO:16]: 

b) detecting the presence or activity of SapM [SEQ ID NO:10], [SEQ ID 
NO:12], [SEQ ID NO:14]. [SEQ ID NO:16] secreted by the 
mycobacterium cells; and 

c) comparing the secretion of SapM [SEQ ID NO:10]. [SEQ ID NO:12], 
[SEQ ID NO:14], [SEQ ID NO:16l in the absence of the coinpound or 
protein to be tested. 
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"Modulation", "modulates" and "modulating" can refer to an increase In 
expression or activity, a decrease in expression or activity, a cliange in the 
type or kind of expression or activity present, a complete cessation of 
expression or activity (I.e., an absence of expression or activity), or the 
instigation of expression or activity. Suitable compounds that may be used 
include but are not limited to proteins, nucleic acids, small molecules, 
hormones, antibodies, peptides, antigens, cytokines, growth factors, 
pharmacological agents including chemotherapeutics, carcinogenics, or other 
cells (i.e. cell-cell contacts). Cells may also be screened for the effects of 
environmental or physiological factors such as radiation, action potentials, etc. 
on normal gene expression. 

Other assays (as well as variations of the above assay) will be apparent from 
the description of this invention and techniques such as those disclosed in 
U.S. Patent Nos. 5,851,788, 5,736,337and 5,767,075 which are incorporated 
by reference in their entirety. For example, the test compound levels may be 
either fixed or increased, and a plurality of compounds or proteins may be 
tested at a single time. 

Kits 

The invention includes a kit for detecting the presence of SapM nucleic acid 
mplecule, comprising at least one probe of the invention. Kits may be 
prepared according to known techniques, for example, see patent nos. 
5,851 ,788 and 5,750.653. The kit preferably includes reagents suitable for 
the hybridization of the probe to a complementary nucleic acid sequence. 
The invention also includes a kit for detecting the presence of SapM protein, 
comprising at least one antl-SapM antibody of the invention. Kits may be 
prepared according to known techniques, for example, see patent nos. 
5,851 .788 and 5,750,653. 

The kit preferably includes an antibody, a medium suitable for the formation of 
an immunological complex between the antibody and a polypeptide 
recognized by the antibody and a reagent capable of detecting the 
immunological complex to ascertain the presence of SapM or a similar 
polypeptide in a biological sample. Further background on the use of 
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antibodies is provided, for example in U.S. Patent Nos. 5,695,931 and 
5,837,472 which are incorporated by reference in their entirety. 

Preparation of Antibodies 

The invention Includes an isolated antibody immunoreactive with a 
polypeptide of the invention. Antibodies are preferably generated against 
epitopes of native SapM [SEQ ID NO:10], [SEQ ID NO:12]," [SEQ ID NO:14], 
[SEQ ID NO:16] or synthetic peptides of SapM [SEQ ID NO:10], [SEQ ID 
NO:12], [SEQ ID NO:14], [SEQ ID NO:16] . The antibody may be labeled with 
a detectable marker or unlabeled. The antibody is preferably a monoclonal 
antibody or a polyclonal antibody. SapM antibodies can be employed to 
screen organisms containing SapM polypeptides. The antibodies are also 
valuable for immuno-puriflcatlon of polypeptides from crude extracts. For 
example, one may contact a biological sample with the antibody under 
conditions allowing the fonmatlbn of an Immunological complex between the 
antibody and a polypeptide recognized by the antibody and detecting the 
presence or absence of the Immunological complex whereby the presence of 
SapM or a similar polypeptide is detected in the sample. The invention also 
includes compositions, preferably including the antibody, a medium suitable 
for the formation of an immunological complex between the antibody and a 
polypeptide recognized by the antibody and a reagent capable of detecting 
the immunolgical complex to ascertain the presence of SapM or a similar 
polypeptide. 

To recognize SapM, one may generate antibodies against a range of unique 
epitopes throughout the molecule, for example [SEQ ID NO:22] 
(NDMHDGSI). One could generate antibodies that target the N-termlnal 
signal sequence, to block the secretion of SapM. In addition, these 
antibodies, or other antibodies directed against other SapM epitopes, could 
block SapM activity by leading to enhanced SapM clearance/degradation, with 
a concomitant decrease in SapM phosphatase activity. 
Monoclonal and polyclonal antibodies are prepared according to the 
description in this application and techniques known in the art. For examples 
of methods of the preparation and uses of monoclonal antibodies, see U.S. 
Patent Nos. 5,688,681 , 5,688,657, 5,683,693, 5,667,781, 5,665,356, 
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5,591,628. 5,510,241, 5.503,987, 5,501,988. 5,500.345 and 5,496,705 that 
are incorporated by reference in their entirety. Examples of the preparation 
and uses of polyclonal antibodies are disclosed in U.S. Patent Nos. 
5,512,282. 4,828,985, 5,225,331 and 5,124.147 which are incorporated by 
reference in their entirety. 

The invention also includes methods of using the antibodies. For example, 
the invention includes a method for detecting the presence of a SapM 
polypeptide such as [SEQ ID NO:10], [SEQ ID NO:12]. [SEQ ID NO:14], 
[SEQ ID NO:16] or [SEQ ID NO:22] by: a) contacting a sample containing 
one or more polypeptides with an antibody of the invention under conditions 
suitable for the binding of the antibody to polypeptides with which it is 
specifically reactive; b) separating unbound polypeptides from the antibody; 
and c) detecting antibody which remains bound to one or more of the 
polypeptides in the sample. 

Immunogenic Compositions 

One skilled in the art will appreciate that suitable methods of administering the 
immunogenic compositions of the present invention are available. Preferably, 
the compositions are administered parenterally, e.g., intravenously, 
intraarterially, intrathecally, subcutaneously, intradermally or intramusculariy. 
The requirements for effective pharmaceutical carriers for parenteral 
compositions are well known to those of ordinary skill In the art (see, e.g. 
Banker and Chalmers (eds.). Pharmaceutics and Pharmacy Practice, J.B. 
Lippincott Company. Philadelphia, PA. (1982) and Toissel. ASIHIP Handbool< 
on Injectable Drugs (4^ ed.)). Such solutions can contain anti-oxidants. 
buffers, bacteriostats, and solutes that render the fonmulations Isotonic with 
the blood of the intended recipient, and aqueous and non-aqueous sterile 
suspensions that can include suspending agents, solubilizers. thickening 
agents, stabilizers, and preservatives. Compounds may be administered in a 
physiologically acceptable diluent in pharmaceutical carrier, such as sterile 
liquid or mixture of liquids, including water, saline, aqueous dextrose and 
related sugar solutions, an alcohol, such as ethanol, isopropanol, or 
hexadecyl alchohol, glycols such as propylene glycol or polyethylene glycol, 
dimethylsulfoxide. glycerol ketals such as 2,2-dimethyH ,3-dioxolane-4- 
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methanol, ethers, such as poly(ethyleneglycx>l) 400, an oil, a fatty acid, a fatty 
acid ester or glyceride or an acetylated fatty acid glyceride with or without the 
addition of a pharmaceutically acceptable surfactant, such as soap or a 
detergent, suspending agent, such as pectin, carbomers, methyl cellulose, 
hydroxypropylmethylceliulose or caboxymethylcellulose or emulsifying agents 
and other pharmaceutical adjuvants. 

Oils useful in parenteral formulations include petroleum, animal, vegetable, or 
synthetic oils. Specific examples of oils useful in such formulations include 
peanut, soybean, cottonseed, corn, olive, petrolatum, and mineral. Suitable 
fatty acids for use in parenteral formulations include oleic acid, stearic acid, 
and isostearic acid. Ethyl oleate and isopropyl myristate are examples of 
suitable fatty acid esters. 

Suitable soaps for use in parenteral formulations include fatty alkali metal, 
ammonium, and triethanolamine salts, and suitable detergents including, 
cationic detergents, anionic detergents, nonionic detergents, amphoteric 
detergents and mixtures thereof. 

The parenteral formulations typically will contain from about 0.5 to about 25% 
by weight of the active ingredient in solutions. Preservatives and buffers may 
be used. Parenteral formulations can be presented in unit-dose or multi-dose 
sealed containers and can be lyophilized. Extemporaneous injection solutions 
and suspensions can be prepared from sterile powders, granules, and tablets. 
Topical formulations, including those that are useful for transdermal drug 
release, are well-known to those of skill in the art and are suitable in the 
context of the present invention for application to the skin. 
Formulations suitable for oral administration can consist of liquid solutions, 
capsules, powders, suspensions and suitable emulsions. Liquid formulations 
may include diluents, such as water and alcohol. Capsule forms can be of the 
ordinary hard- or soft-shelled gelatin type containing, for example, surfactants, 
lubricants, and inert fillers such as lactose, sucrose, calcium phosphate, and 
cornstarch. Tablet forms can include excipients, colorants, diluents, buffering 
agents, disintegrating agents, moistening agents, preservatives, flavoring 
agents and pharmacologically compatible excipients. 
The immunogenic compositions of the current invention can be made into 
aerosol formulations to be administered by inhalation. Formulations suitable 
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for aerosol administration may include surfactants and propellant. A carrier 
may also be included for intranasal delivery. 

The compositions of the present invention may also be administered as 
suppositories by mixing with a variety of bases, such as emulsifying bases or 
water-soluble bases. Formulations suitable for vaginal administration may be 
presented as pessaries, tampons, creams, gels, pastes, foams or spray 
formulas. 

The concentration of SapM protein or polypeptide fragment thereof for 
immunogenic compositions can vary widely, i.e., from less than about 1% 
usually at or at least about 10% to as much as 20 to 50% or more by weight 
and will be selected primarily be fluid volumes, viscosities, etc., in accordance 
with the particular mode of administration selected. Actual methods for 
preparing parenterally administrable compositions will be know or apparent to 
those skilled in the art and are described in more detail, for example, 
Remington's Phannaceutical Science (17^ ed.. Mack Publishing company, 
Easton, PA, (1985). 

The compounds of the present invention may be formulated as inclusion 
complexes or liposomes. Liposomes serve to target the SapM protein or 
polypeptide fragment thereof peptides to a particular tissue. Liposomes can 
also increase the half-life of the compositions. 

Vaccine Preparation and Use 

Immunogenic compositions, suitable to be used as vaccines, may be 
prepared from SapM protein and polypeptide fragments thereof and nucleic 
acid molecules encoding SapM proteins and polypeptide fragments thereof. 
For examples of methods of the preparation and uses of vaccines, see U.S. 
Patent Nos. 4,601,903. 4.599,231, 4,599,230, and 4,596,792 that are 
incorporated by reference in their entirety. 

Immunogenic compositions including vaccines may be prepared as 
Injectables, either as liquid solutions or suspensions; solid forms suitable for 
solution in, or suspension in, liquid prior to injection may also be prepared. 
The preparation may also be emulsified, or the protein encapsulated in 
liposomes. The live immunogenic ingredients are often mixed with excipients 
that are pharmaceutically acceptable and compatible with the active 
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ingredient. Suitable excipients are, for example, water, saline, dextrose, 
glycerol, ethanol, or the like and combinations thereof. In addition, if desired, 
the vaccine may contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents, and/or adjuvants that 
enhance the effectiveness of the vaccine. Examples of adjuvants which may 
be effective include but are not limited to: aluminum hydroxide, N-acetyl- 
muramyl-L-threonyl-D-isoglutamlne (thr-MDP), N-acetyl-nor-mu.ramyl-L-alanyl- 
D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramyl-L- 
alanyl-D-isoglutaminyl-L-alanine-2-(1'-2-dipalmitoyl-sn -glycero-3- 
hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), 
and RIBI, which contains three components extracted from bacteria, 
monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(MPL+TDM+CWS) in a 2% squalene/Tween 80™ emulsion. 
The effectiveness of an adjuvant may be determined by measuring the 
amount of antibodies directed against an immunogenic polypeptide containing 
a SapM antigenic sequence resulting from administration of the SapM 
vaccines that are also comprised of the various adjuvants. 
The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramusculariy. Additional formulations 
which are suitable for other modes of administration include suppositories 
and, in some cases, oral formulations. For suppositories, traditional binders 
and carriers may include, for example, polyalkylene glycols or triglycerides; 
such suppositories may be formed from mixtures containing the active 
ingredient in the range of 0.5% to 10%, preferably 1%-2%. Oral formulations 
include such normally employed excipients as, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions take the form of 
solutions, suspensions, tablets, pills, capsules, sustained release formulations 
or powders and contain 10%-95% of active ingredient, preferably 25%-70%. 
The vaccines are administered in a manner compatible with the dosage 
formulation, and in such amount as will be prophylactically and/or 
therapeutically effective. 

The vaccine may be given in a single dose schedule, or preferably in a 
multiple dose schedule. A multiple dose schedule is one in which a primary 
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course of vaccination may be with 1-10 separate doses, followed by other 
doses given at subsequent time intervals required to maintain and or reinforce 
the immune response, for example, at 1-4 months for a second dose, and if 
needed, a subsequent dose(s) after several months. The dosage regimen will 
also, at least in part, be determined by the need of the individual and be 
dependent upon the judgment of the practitioner. 
In addition, the vaccines may be administered in conjunction with other 
immunoregulatory agents, for example, immune globulins. 

Nucleic Acid Molecules 

Functionally equivalent nucleic acid molecule or polypeptide sequence 

The term "isolated DNA sequence" refers to a DNA sequence the structure of . 
which is not identical to that of any naturally occurring DNA sequence or to 
that of any fragment of a naturally occurring DNA sequence spanning more 
than three separate genes. The term therefore covers, for example, (a) DNA 
which has the sequence of part of a naturally occurring genomic DNA 
molecule; (b) a DNA sequence incorporated into a vector or into the genomic 
DNA of a prokaryote or eul^aryote, respectively, in a manner such that the 
resulting molecule is not identical to any naturally occurring vector or genomic 
DNA; (c) a separate molecule such as cDNA, a genomic fragment, a 
fragment produced by reverse transcription of poIyA RNA which can be 
amplified by PGR, or a restriction fragment; and (c) a recombinant DNA 
sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. 
Specifically excluded from this definition are nucleic acids present in mixtures 
of (i) DNA molecules, (li) transfected cells, and (iii) cell clones, for 
example, as these occur in a DNA library such as a cDNA or genomic DNA 
library. 

Modifications in the DNA sequence, which result in production of a chemically 
equivalent or chemically similar amino acid sequence, are included within the 
scope of the invention. Modifications include substitution, insertion or deletion 
of nucleotides or altering the relative positions or order of nucleotides. 
Variants of the polypeptides of the invention may occur naturally by mutation 
for example, or may be made with polypeptide engineering techniques such 
as site directed mutagenesis for example, whicii are well known in the art for 
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substitution of amino acids. For example, a hydrophobic residue such as 
alanine may be substituted with a more hydrophobic residue such as leucine, 
valine or isoleucine. A negatively charged amino acid such as aspartic acid 
may be substituted for glutamic acid. A positively charged amino acid such as 
lysine may be substituted for another positively charged amino acid such as 
arginine- 

Therefore, the invention includes polypeptides having conservative changes 
or substitutions in amino acid sequences. Conservative substitutions insert 
one or more amino acids, which have similar chemical properties as the 
replaced amino acids. The invention includes sequences where conservative 
substitutions are made that do not destroy SapM activity. 
The invention also includes polypeptide fragments of the polypeptides of the 
invention that may be used to confer SapM activity if the fragments retain 
activity. The invention also includes polypeptide fragments of the 
polypeptides of the invention which may be used as a research tool to 
characterize the polypeptide or its activity. Such polypeptides preferably 
consist of at least 5 amino acids. In preferred embodiments, they may consist 
of 6 to 10, 1 1 to 15, 16 to 25, 26 to 50, 51 to 75,76 to 100 or 101 to 250 amino 
acids of the polypeptides of the invention (or longer amino acid sequences). 
The fragments preferably have SapM activity. Fragments may include 
sequences with one or more amino acids removed, for example, C-terminus 
amino acids in a SapM sequence. 

This invention also includes sequences where conservative substitutions are 
nriade that increase or decrease the activity of SapM. 
Polypeptides comprising one or more d-amino acids are contemplated within 
the invention. Also contemplated are polypeptides where one or more amino 
acids are acetylated at the N-terminus. Those of skill in the art recognize that 
a variety of techniques are available for constructing polypeptide mimetics 
with the same or similar desired SapM activity as the corresponding 
polypeptide compound of the invention but with more favourable activity than 
the polypeptide with respect to solubility, stability, and/or susceptibility to 
hydrolysis and proteolysis. See, for example, Morgan and Gainor, (1989) 
Ann. Rep. Med. Chem., 24:243-252. Examples of polypeptide mimetics are 
described in U.S. Patent No. 5,643,873. Other patents describing how to 
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make and use mimetics include, 5,786,322, 5,767,075, 5,763,571 , 5,753,226, 
5,683,983, 5,677,280. 5,672,584, 5,668.110, 5,654,276. 5,643,873. Mimetics 
of tiie polypeptides of the invention may also be made according to other 
techniques known in the art, for example, by treating a polypeptide of the 
invention with an agent that chemically alters a side group by converting a 
hydrogen group to another group such as a hydroxy or amino group, 
Mimetics preferably include sequences that are either entirely made of amino 
acids or sequences that are hybrids including amino acids and modified 
amino acids or other organic molecules. 

The invention also includes hybrid nucleic acid molecules and polypeptides, 
for example where a sapM DNA sequence from one species is combined with 
a nucleotide sequence from a sequence of plant, mammal, bacteria or yeast 
to encode a fusion polypeptide. The invention includes a fusion protein 
having at least two components, wherein a first component of the fusion 
protein comprises a polypeptide of the invention, preferably a full length SapM 
polypeptide (or a portion thereof, see below). The second component of the 
fusion protein preferably comprises a tag, for example GST, an epitope tag or 
an enzyme. The fusion protein may also comprise a histochemical or 
cytochemical marker such as /acZ, alkaline phosphatase, or horseradish 
peroxidase, or a fluorescent marker such as GFP or one of its derivatives. 
The invention also includes a composition comprising all or part of an isolated 
DNA molecule (preferably sapM [SEQ ID NO:9]. [SEQ ID NO:11], [SEQ ID 
NO:13], [SEQ ID NO;15])of the invention with or without a carrier, preferably 
in a composition for cell transformation. The invention also includes a 
composition comprising an isolated SapM polypeptide (preferably SapM 
[SEQ ID NO:10], [SEQ ID NO:12l, [SEQ ID NO:14], [SEQ ID NO:16] or [SEQ 
ID NO: 22]) with or without a carrier, preferably for studying or modulating 
polypeptide activity. 

Sequence Identity 

The invention includes modified nucleic acid molecules with a sequence 
identity at least about: >17%, >20%, >30%, >40%. >50%, >60%, >70%, >80% 
or >90% more preferably at least about >95%, >99.5%, to the DNA 
sequences provided in [SEQ ID NO: 1] to [SEQ ID NO: 4] or [SEQ ID NO:9], 
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[SEQ ID NO:1 1], [SEQ ID NO:13], [SEQ ID NO:153 (or a partial sequence 
thereof or their complementary sequence). Preferably about 1 , 2, 3, 4, 5, 6 to 
10, 10 to 25, 26 to 50, 51 to 100, or 101 to 250 nucleotides are modified. 
Sequence identity is most preferably assessed by the algorithm of the BLAST 
version 2,1 program advanced search (parameters as above). BLAST is a 
series of programs that are available online at 
http//www.ncbi. nlm.nih.gov/BLAST. 
References to BI-AST searches are: 
References to BLAST searches are: 

Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic 

local alignment search tool." J. Mol. Biol. 215:403-410. 

Gish, W. & States, D.J. (1993) "Identification of protein coding regions by 

database similarity search." Nature Genet. 3:266-272. 

Madden, T.L., Tatusov, R.L. & Zhang, J. (1996) "Applications of network 

BLAST server" Meth. Enzymol. 266:131-141. 

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. & 
Lipman, D.J. (1997) "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs." Nucleic Acids Res. 25:3389-3402. 
Zhang, J. & Madden, T.L. (1997) "PowerBLAST: A new network BLAST 
application for interactive or automated sequence analysis and annotation." 
Genome Res. 7:649-656. 

The polypeptides encoded by the homologous sapM nucleic acid molecule in 
other species will have amino acid sequence identity at least about: >20%, 
>25%, >28%, >30%, >40% or >50% to an amino acid sequence as provided 
in [SEQ ID NO:10]. [SEQ ID NO:12], [SEQ ID NO:14], [SEQ ID NO:16] (or a 
partial sequence thereof). Some species may have polypeptides with a 
sequence identity of at least about: >60%, >70%, >80% or >90%, more 
preferably at least about: >95%, >99% or >99.5% to all or part of an amino 
acid sequence in as shown in [SEQ ID NO:10], [SEQ ID NO:12], [SEQ ID 
NO: 14], [SEQ ID NO: 16] (or a partial sequence thereof). Identity is calculated 
according to methods known in the art. Sequence identity is most preferably 
assessed by the BLAST version 2.1 program advanced search (parameters 
as above). Preferably about: 1 , 2, 3, 4, 5, 6 to 10, 10 to 25, 26 to 50, 51 to 
100, or 101 to 250 nucleotides or amino acids are modified. 
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The invention includes nucleic acid molecules with mutations that caiise an 
amino acid change in a portion of the polypeptide not involved in providing 
Sapl\/I activity or an amino acid change in a portion of the polypeptide involved 
in providing Sapl\/I activity so that the mutation increases or decreases the 
activity of the polypeptide. 

The sequences of the invention can be prepared according to numerous 
techniques. The invention is not limited to any particular preparation means. 
For example, the nucleic acid molecules of the invention can be produced by 
cDNA cloning, genomic cloning, cDNA synthesis, polymerase chain reaction 
(PGR) or a combination of these approaches (Cunrent Protocols in Molecular 
Biology. P.M. Ausbel et al., 1989). Sequences may be synthesized using 
well-known methods and equipment, such as automated synthesizers. 

Hybridization 

Other functional equivalent forms of the SapM DNA molecules can be isolated 
using conventional DNA-DNA or DNA-RNA hybridization techniques. These 
nucleic acid nrK>lecules and the SapM sequences can be modified without 
significantiy affecting their activity. 

The present invention also includes nucleic acid molecules that hybridize to 
one or more of the DNA sequences provided in [SEQ ID NO:1] to [SEQ ID 
NO:4] and [SEQ ID NO:91, [SEQ ID NO:11], [SEQ ID NO:13l, [SEQ ID 
NO:15] (or a partial sequence thereof or their complementary sequence) and 
that encode peptides or polypeptides exhibiting substantially equivalent 
activity as that of a SapM polypeptide produced by the DNA in [SEQ ID 
NO:10], [SEQ ID NO:12]. [SEQ ID NO:14], [SEQ ID NO:16] Such nucleic acid 
molecules preferably hybridize to all or a portion of sapM or its complement 
under low, moderate (intemnediate), or high stilngency conditions as defined 
herein (see Sambrook et al. (most recent edition) Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor. N.Y.; Ausubel et al. (eds.), 1995, Cun^nt Protocols in Molecular 
Biology, (John Wiley & Sons, NY)). The portion of the hybridizing nucleic 
acids is typically at least 15 (e.g. 20, 25, 30 or 50) nucleotides in length. The 
hybridizing portion of the hybridizing nucleic acid is at least 80% e.g. at least 
95% or at least 98% identical to the sequence or a portion or all of a nucleic 
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acid encoding a SapM polypeptide, or its complement. Hybridizing nucleic 
acids of the type described herein can be used, for example, as a cloning 
probe, a primer (e.g. a PGR primer) or a diagnostic probe. Hybridization of 
the oligonucleotide probe to a nucleic acid sample typically is performed 



as the melting temperature orTm, which is the temperature at which a probe 
dissociates from a target DNA. This melting temperature is used to define the 
required stringency conditions. If sequences are to be identified that are 
related and substantially identical to the probe, rather than identical, then it is 
useful to first establish the lowest temperature at which only homologous 
hybridization occurs with a particular concentration of salt (e.g. SSC or 
SSPE). Then, assuming that 1% mismatching results in a 1 degree Celsius 
decrease in the Tm, the temperature of the final wash in the hybridization 
reaction is reduced accordingly (for example, if sequences having greater 
than 95% identity with the probe are sought, the final wash temperature is 
decreased by 5 degrees Celsius). In practice, the change in Tm can be 
between 0.5 degrees Celsius and 1 .5 degrees Celsius per 1% mismatch. 
Low stringency conditions involve hybridizing at about: 1XSSC, 0.1% SDS at 
50*^C. High stringency conditions are: 0.1XSSC, 0.1 % SDS at 65°C. 
Moderate stringency is about IX SSC, 0.1% SDS at 60 degrees Celsius. The 
parameters of salt concentration and temperature can be varied to achieve 
the optimal level of identity between the probe and the target nucleic acid. 
The present invention also includes nucleic acid molecules from any source, 
whether modified or not, that hybridize to genomic DNA, cDNA, or synthetic 
DNA molecules that encode the amino acid sequence of a SapM polypeptide, 
or genetically degenerate forms, under salt and temperature conditions 
equivalent to those described in this application, and that code for a peptide, 
or polypeptide that has SapM activity. Preferably the polypeptide has the 
same or similar activity as that of a SapM polypeptide. A nucleic acid 
molecule described above is considered to be functionally equivalent to a 
sapM nucleic acid molecule of the present invention if the polypeptide 
encoded by the nucleic acid molecule is recognized in a specific manner by a 
SapM-specific antibody, including, but not restricted to, the antibodies listed in 
this application. 



under stringent conditions. Nucleic acid duplex or hybrid stability is expressed 
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Expression of sapM is induced in A7. bovis BCG grown on ammonium 
cliioride 

Bacterial acid and alkaline phosphatases are typically regulated by 
environmental inorganic phosphate concentration (reviewed in (Rossolini et 
el., 1998)). The expression of these enzymes, which catalyze the hydrolysis of 
exogenous sources of phosphorylated components into inorganic phosphate, 
Is induced under phosphate starvation. This essential limiting nutrient is then 
transported into the cell by membrane permeases. A recent study showed 
that SapM of M. tuberculosis is an acid phosphatase secreted to the culture 
medium (Saleh and Belisle, 2000). To determine if SapM is involved in 
phosphate assimilation, the effect of environmental phosphate concentration 
on SapM expression and activity were examined. Four M. bovis BCG strains, 
organisms closely related to M. tuberculosis, were grown in Sauton medium 
containing three different concentrations of inorganic phosphate, 0.05, 0.5, 
and 5.0 g/L, and the acid phosphatase activity in the culture filtrate protein 
(CFP) fractions was assayed. Under these conditions, acid phosphatase 
activity is low, ranging from 40- to 130 -nmol hr'^ mg"^ (total protein). In 
contrast to most bacterial acid phosphatases, the acid phosphatase activity in 
the CFP fraction of BCG cultures was not regulated by the phosphate 
concentration, and phosphate starvation did not increase its activity (Fig. 1A)- 
Starvation of carbon source in the culture medium also did not affect the 
enzyme activity (Fig. 1A). 

Interestingly, high levels of acid phosphatase activity were detected in BCG 
grown in media containing ammonium chloride (NH4CI) as the nitrogen source 
(Fig. 1 A). The original Sauton medium contains 27 mM of asparagine as the 
primary nitrogen source. Asparagine was substituted by NH4CI at the equal 
molar concentration and the fresh medium was adjusted to pH 7.4. All BCG 
strains tested, BCG-Birkhaug, -Japan, -Frappier and -Pasteur, grow normally 
in the ammonium-containing Sauton medium. The CFP fractions prepared 
under these conditions exhibited much higher levels of acid phosphatase 
activity than those grown in the original Sauton (Fig. 1A). BCG-Birkhaug 
exhibited a dramatic 25-fold increase, from 136- to 3,459- nmol-hr"^-mg"\ 
Other BCG strains exhibited 5- to 10-fold increases of acid phosphatase 
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activity. Adjustment of phosphate or glycerol concentrations in the 
ammonium-containing Sauton media did not change the enzyme activity level 
(data not shown). 

It was previously shown that SapM is the only acid phosphatase detected in 
the extracellular fraction of M. tuberculosis cultures (Saleh and Belisle, 2000). 
To determine if SapM contributes to the observed increase of acid 
phosphatase activity, CFP fractions were analyzed by SDS-PAGE. As shown 
in Fig. 1B, a protein band, visualized by silver staining, was produced by 
BCG-Birl<haug grown in ammonium-containing Sauton, and was not detected 
in culture grown in normal Sauton medium. The mobility of this protein 
corresponds to the molecular weight of the mature SapM protein (--28 kD) and 
to that of partially purified SapM. To confirm the identity of this protein, an 
antibody raised against a synthetic peptide corresponding to amino acid 
residues 201-208 of SapM protein was made and Westem blotting 
performed. Indeed, the 28-kD protein reacted with the anti-SapM antisera, 
indicating that this protein is SapM (Fig. 1B). For other BCG strains, SapM 
protein is less visible, which correlates with the lower level of enzyme activity 
detected in these strains (Fig. 1 A). 
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SapM of M. bows BCG is selectively expressed at mildly acidic pH 

It was not immediately clear why SapM expression would be induced in media 
containing NH4CI. Although our initial hypothesis was that sapM was 
regulated by the availability of nitrogen source, nitrogen starvation did not 
alter the SapM expression or activity (data not shown). However, uptake of 
ammonia (NH4^) is accomplished by a membrane transporter, which may 
couple ammonia uptake to export (Westhoff ef a/., 2002). This would result 
in a gradual acidification of the culture medium. Indeed, measuring the pH of 
the spent BCG culture media revealed that cell growth leads to acidification of 
Sauton medium containing NH4CI (i.e., the pH changed from 7.4 to 5.8 - 6.2), 
whereas the pH of asparagine-containing Sauton medium remains at 7.4 — 
7.8. 

We therefore hypothesized that the acidification of the culture medium is 
responsible for the induction of SapM expression. To test this, BCG-Birkhaug 
was grown in asparagine-containing Sauton medium (pH 7.4), then portions of 
cells were washed and cultured in the same medium except the pH was 
adjusted to 7.0, 6.6, 6.2, and 5.8 with appropriate buffers. Acid phosphatase 
activity from the CFP fractions of these cultures was determined and revealed 
a correlation between the level of acid phosphatase activity and the pH of the 
culture media (Fig. 2A). At pH 7.0 - 7.4, the level of acid phosphatase activity 
is very low. Under mildly acidic conditions (i.e., pH 5.8 - 6.6), a high level of 
acid phosphatase activity was detected and was maximal at pH 6.2 (Fig. 2A). 
SDS-PAGE and Western blotting analyses confirmed that SapM expression is 
induced at pH 5.8 - 6.6 (Fig. 2B), which correlates well with the high levels of 
enzyme activity at the same pH. These results indicate that sapM is 
selectively expressed at mildly acidic pH. The level of SapM activity detected 
in BCG-Birkhaug grown at pH 5.8 - 6.2 is similar to that grown in ammonium- 
containing Sauton medium (compare Figs. 1A & 2A), suggesting that the 
induction of SapM expression by ammonium described above actually reflects 
the effect of pH. 

At pH 7.0 - 7.4, SapM was not detected either by silver staining or Western 
blotting (Fig. 2B), suggesting that SapM level is very low or below detection 
limits, which is consistent with the residual acid phosphatase activity detected 
at these pH values (Fig. 2A). Alternatively, the residual acid phosphatase 
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activity may originate from a small fraction of cell surface-associated acid 
phospliatase that leaks to the external medium during cell growth. Other than 
SapM, only one gene product of tuberculosis, Rv2577, is predicted to be 
an acid phosphatase (Cole a/., 1998). However, Rv2577 does not posses 
an export signal sequence and appears to be cell surface associated 
(Braibant and Content 2001). The activities of cell surface-associated acid 
phosphatase and alkaline phosphatase of BCG were not affected by the 
change of pH in the culture medium (Fig. 2C), suggesting that the pH- 
dependent expression is a specific property of SapM. 

Cloning and expression of M. tuberculosis sapM in Mycobacterium 
smegmatis 

SapM was originally identified by biochemical approaches from jthe 
extracellular medium of M. tuberculosis H37Rv grown in glycerol-alanine-salts 
(GAS) medium (Saleh and Belisle, 2000). Like our ammonium-containing 
Sauton medium, GAS contains NH4CI as a nitrogen source and the spent 
GAS medium is mildly acidic (data not shown). To confimi that the SapM of 
M. tuberculosis shares the same property as that of M. bovis BCG, a DNA 
fragment containing the sapM gene {Rv3310) was cloned into shuttle vector 
pMD31 to generate pSAP (Fig. 3A), which was then transformed into 
Mycobacterium smegmatis mc^-1 55. The sapM gene is absent from the M. 
smegmatis chromosome and SapM activity was.not detected in the 
extracellular CFP fraction (see below). However, when recombinant M. 
smegmatis containing sap/W was grown in Sautoh medium containing NH4CI, 
SapM protein was detected upon SDS-PAGE and Westem blotting analyses 
of the CFP fraction (Fig. SB). SapM was absent from the pSAP strain grown in 
Sauton containing asparagine, and from the control strain containing the 
cloning vector pMDSI . These results indicated that M. tuberculosis sapM 
gene can be expressed and secreted from M. smegmatis. 
To examine if M. tuberculosis sapM is regulated by pH, mc^-155/pSAP was 
grown in Sauton medium (containing asparagine) adjusted to various pH 
levels, as described above. The enzyme activity and the protein concentration 
of SapM correlated with each other, and both were dependent on the pH of 
the culture medium (Figs. 4A - C). High levels of SapM expression and activity 
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were detected at pH 5.8 - 6.6, which is in agreement with results obtained in 
M. bovis BCG. The control strain containing the cloning vector alone showed 
residual acid phosphatase activity, which is likely due to leakage of the cell 
surface-associated acid phosphatase, and is not pH inducible. 

The A7. tuberculosis sapM gene is selectively expressed inside 
macrophages 

Our in vitro studies demonstrate that sapM is selectively expressed at pH 5.8 
- 6.6, a pH level that coincides with mildly acidic mycobacteria-containing 
phagosome (pH 6.1 - 6.5) (Sturgill et al., 1994), which suggests that sapt\/l 
expression is induced within macrophages. To demonstrate that sapM is 
expressed in vivo, two transcriptional fusions were constructed. pSAPM-GFP 
contains 560 bp upstream of the sapM coding region and the sapM gene 
truncated 67 bp from its 3' end. pSAPC-GFP contains the same 560 bp sapM 
promoter fragment, the entire sapM gene and the sapC gene (Rv3311) 
truncated 183 bp from its 3' end (Fig. 5A). The sapM and sapC genes appear 
to be co-transcribed. In both vectors, the truncated genes are transcriptionally 
fused to a promoteriess mutant green fluorescent protein (mGFP) (Barker et 
aL, 1998). These constructs, together with the cloning vector pFPV27, were 
transformed into l\Aycobacterium marinum, a fish pathogen that also produces 
SapM from a chromosomal loci (see below). Recombinant M marinum strains 
grown in asparagine-containing Sauton medium (pH 7.4), in which the sapM 
expression is not induced or at very low levels, were used to infect the murine 
inacrophage-like cell line J774A.1. As predicted, within macrophages, cells of 
M. marinum containing pSAPM-GFP or pSAPC-GFP were brightly 
fluorescent, whereas the extracellular bacteria had minimal fluorescence (Fig. 
5B). A/f. marinum strains containing the cloning vector pFPV27 (Fig. 58) or the 
construct in which sapM promoter was inserted in the opposite orientation 
(data not shown) were nonfluorescent. This result, together with our in vitro 
data described above, indicates that sapM is induced by the mildly acidic 
environment of mycobacterial phagosome of cultured macrophages. 
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The sapM gene is present in pathogenic mycobacteria only 

As described above, SapM protein and enzyme activity were not detected in 
M. smegmatis, a nonpathogenic Mycobacterium species. To examine if SapM 
is specific to pathogenic mycobacteria, we analyzed two other Mycobacterium 
species, the nonpathogenic Mycobacterium chelonae and the fish pathogen, 
M. marinum. Like M. bovis BCG, the CFP of M. marinum culture grown in 
ammonium-containing Sauton medium exhibited a high level of acid 
phosphatase activity (data not shown), and this activity was dependent on the 
pH of the culture media (data not shown). In contrast, M. chelonae was similar 
to M, smegmatis and exhibited a residual level of acid phosphatase activity 
that was not regulated by media pH (data not shown). These results suggest 
that M. marinum contains a SapM homolog, whereas M. chelonae and M. 
smegmatis do not. Searching the unpublished genome sequences available 
at TIGR and the Sanger center revealed that SapM homologs are present in 
M. marinum and Mycobacterium avium (a bird pathogen), but not M. 
smegmatis (Fig. 6A). 

To confirm the above results, DNA hybridizations of chromosomal DNA of M. 
smegmatis and M. ctielonae using a radiolabeled probe specific for M, 
tuberculosis sapMwere performed. Chromosomal DNA of M. bovis BCG and 
the sapM-containing plasmid pSAP were included as positive controls. The 
result showed that the sapM allele was not detected in M. smegmatis and M. 
chelonae chromosomes (Fig. 6B). 

immunbgenicity of SapM 

The antigenicity of SapM in human infection was assessed by Westem blot 
using partially purified M. tuberculosis SapM and sera collected from seven 
individuals. These individuals included one patient diagnosed with lymph node 
TB who began chemotherapy four months prior to collection of the serum; one 
patient who was sputum smear-positive for M. avium; two individuals who had 
been vaccinated with M. bovis BCG; three apparently healthy individuals 
including one involved with patient care and one who tested PPD positive. 
Interestingly, sera from two patients (TB patient and patient infected by M. 
avium) reacted with SapM (lanes 4 and 5, Fig. 7), whereas sera from two 
healthy individuals including the one tested for PPD-positive did not react with 
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SapM (lanes 2 and 6). However, sera from two individuals who had been 
vaccinated with BCG (lanes 7 and 8) and the individual involved in TB patient 
care (lane 3) also reacted with SapM. These results suggest that SapM is 
recognized by antibodies from TB patients, or from individuals who had been 
vaccinated with BCG, but not from healthy individuals who had not been 
exposed to M. tuberculosis. 

Mycobacterium tuberculosis produces a secreted acid phosphatase, SapM, 
which exhibits remarkably high activity towards GTP and NADPH. The 
biological functions of SapM are unknown. In this study, we found that sapM 
is regulated by pH. The expression level and enzyme activity of SapM are 
dramatically increased (up to -30 fold) when the pH of culture medium is 
lowered from 7.0 - 7.4 to 5.8 - 6.6. Transcriptional fusions of truncated sapM 
with the promoterless gfp demonstrated that sapM expression is induced in 
infected macrophages. DNA hybridizations and sequence analyses indicated 
that the sapM gene is present in pathogenic mycobacteria including 
Mycobacterium bovis BCG, Mycobacterium avium, and Mycobacterium 
marinum, biit is absent from nonpathogenic mycobacteria such as 
Mycobacterium smegmatis and Mycobacterium chelonae. Moreover, 
antibodies from TB patients or BCG vaccinated individuals but not from 
healthy individuals recognize SapM. Collectively, these results suggest that 
SapM is important for mycobacterial pathogenesis, which may contribute to 
intracellular survival by interfering with effector molecules involved in 
phagosome maturation. 

Materials and Methods 
Strains and culture conditions 

M. bovis BCG strains, BCG-Japan, -Pasteur, -Frappier, and -Birkhaug, were 
pnDvided by Marcel Behr (McGill University). M. marinum 1218R strain was 
provided by Lucia Barker (Rocky Mountain Laboratories, NIAID). M. 
smegmatis m(^-155 and M. chelonae PS4770 were described previously (Liu 
et al.. 1995; Liu et ai, 1996). 

Mycobacterial cells were routinely grown in nonnal Sauton media containing 
(per Irter): 0.5 g KH2PO4, 0.14 g MgS04. 2.0 g citric acid, 0.05 g ferric 
ammonium citrate, 5.0 g asparagine, and 60 ml glycerol. The pH was adjusted 
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to 7.4. To Study the effect of phosphate concentration on acid phosphatase 
activity, the bacteria were grown in Sauton medium to exponential phase, 
washed with the same medium without KH2PO4 and inoculated in 100 ml of 
Sauton medium in which the phosphate concentration was modified as follows 
(per liter): KH2PO4 was increased by 10 fold to 5.0 g (high Pi medium), or 
reduced by 10 fold to 0.05 g (low Pi medium). For studying the effect of 
nitrogen source on acid phosphatase, asparagine (5 g/l) in the original Sauton 
was substituted with NH4CI (1.42 g/l). For carbon source starvation, glycerol 
was omitted from the Sauton medium. For studying the effect of pH on acid 
phosphatase activity, the pH of the original Sauton medium was adjusted with 
buffers: 20 mM MOPS (for pH 7.0 and 6:6), or 20 mM MES (for pH B.2 and 
5.8). All cultures were grown with continuous shaking at 37**C, except for M. 
marinum and M. chelonae, which were grown at 30**C- 

Molecular cloning 

An ordered BAG library of M. tuberculosis H37Rv genome was used as DNA 
template for cloning. Standard protocols were used for manipulation of DNA. 
Cloning of sap/W was accomplished by ligating a 3 kb Nhe\ fragment of 
BAC403, containing Rv3309 (i/pp), Rv3310 (sapM) and Rv3311 (sapC), into 
the unique Xba\ site of plasmid pMD31 to generate pSAP. Transcriptional 
fusions to a promoteriess mutant gfp were constructed by cloning fragments 
of pSAP into pFPV27. To constmct pSAPC-GFP, a 2.56 kb EcoRV fragment 
of pSAPM, comprising 560 bp upstream of sapM, the complete sapM gene, 
and sapC {Rv3311) truncated 183 bp from its 3' end, was inserted into the ' 
unique EcoRV site of pFPV27 (Bari<er at aL, 1998). To generate pSAPM- 
GFP, vector pSAPC-GFP was cut with Sg/ll and the 3.8 kb and 2.5 kb 
fragments, comprising pFPV27, 560 bp upstream of the sapM start codon and 
all but 67 bp of the sapM coding region, were re-ligated. Plasmids were 
introduced into M. marinum and M. smegmatis mc^-155 by electroporation, 
and recombinants were selected on Middlebrook 7H9 agar (Difco) 
supplemented with 10% OADC enrichment and 25 jag/ml kanamycin. 
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Southern blotting 

Southern blotting was perfonmed according to standard methods. Briefly, 
chromosomal DNA was isolated from M. bovis BCG, M. smegmatis mc^-155, 
and M chelonae, digested with EcoRI or EcoRV, separated on an agarose gel 
and transferred to a nylon membrane (Hybond N+, Amersham Pharmacia). 
Membranes were then hybridized with a^^P-CTP radiolabeled sapM probe (a 
660 bp Hphl-Nde\ fragment of pSAP) and exposed on a phosphoimager 
cassette. 

Enzyme assays 

For assay of secreted acid phosphatase activity, the culture filtrate protein 
(CFP) fractions were collected from cultures grown to exponential phase (Aeoo 
--0.8), concentrated by centrifugal filters (Millipore), and dialyzed (150-200 x 
volumes) against distilled water overnight at4®C. Protein concentrations were 
determined using the BCA assay (Pierce). Acid phosphatase assay was 
carried out using p-nitrophenyiphosphate (pNPP) at pH 6.8 in a microtitre 
format as described previously (Saleh and Belisle, 2000). Specific activity was 
calculated using an extinction coefficient for nitrophenol of 18,380 mlVI hr'* 
cm'\ and the phosphatase activity was expressed as nmol hr^ mg"^ (total CFP 
protein). 

For assay of cell surface-associated acid phosphatase and alkaline 
phosphatase, bacterial cells were collected and briefly sonicated to disrupt 
cell clumps. Cells were then washed in Tris-buffered saline three times 
followed by a final wash in distilled water and ceil density was adjusted to a 
final OD595 of 0.42. The phosphatase activity of whole cells toward pNPP was 
assayed by measuring p-nitrophenol released as described above. Typically, 
50 ^1 of bacterial cells were mixed with 150 \x\ of 25 mM acetate buffer, pH 6.0 
(for acid phosphatase assay) or 25 mM Tris, pH 10.0 (for alkaline 
phosphatase). Reaction buffers also contained 1 mM each of MgCb and 
ZnCl2. The reactions were initiated by the addition of pNPP to a final 
concentration of 1 mg/ml. Following incubation at 37°C for 30 min., cells were 
pelleted by centrifugation and the top 150 iil of the reactions were transferred 
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to ELISA plate wells containing 50 jJ of 1.0 M NaOH, which stopped the 
reaction, and the OD405nm measured in an ELISA reader. 

Generation of anti-SapM antisera 

A peptide, corresponding to residues 201-208 in the SapM amino acid 
sequence, was synthesized using solid phase peptide synthesis. The peptide 
was modified by acetylation of the carboxyl end and by the addition of a 
cysteine at the amino end. Approximately 10 mg of the purified peptide, 
having the sequence CNDGHDGSI-Ac, was reduced by dissolving in 4,0 ml of 
10 mM ammonium bicarbonate buffer. pH 8.0 containing 20 mM DTT (buffer 
A) and Incubating at room temperature for 2 hours. The reduced peptide was 
loaded onto an 1 .5 cm x 4.0 cm anion-exchange column (DEAE-Sephadex 
A50, Pharmacia) pre-equillbrated in buffer A, followed by washing with 25 ml 
of same buffer without DTT. Bound peptide was eluted with 10 ml of 10 mM 
acetate buffer, pH 3.0. The peptide, in a total volume of 2.0 ml, was 
immediately added to a tube containing 2 mg of mcKLH (Pierce Sciences), 
followed by the addition of 200 |al of 1 .0 M phosphate buffer, pH 7.4 and 
mixed continuously at room temperature for 2 hours. The conjugated KLH 
was subsequently desalted on a G-10 column (1 .5 cm x 25 cm) in PBS. Two 
New Zealand rabbits were primed with 250 |ag of the KLH conjugate, followed 
by a first boost at 28 days and a second boost at 56 days (100 ^g each). Sera 
were screened 9 days following the second boost and the antisera collected 
the following day. 

SDS-polyacrylam!de gel electrophoresis and Western blotting 

For detection of SapM protein, CFP samples (typically 8-10 \xg protein) were 
separated on sodium dodecyl sulfate (SDS)-14% polyacrylamide gels, and 
visualized by silver staining, or for Western blotting, transferred to 
nitrocellulose membranes. The membranes were then probed with a 1 :400 
dilution of anti-SapM antisera and developed with anti-rabbit IgG-alkaiine 
phosphatase conjugate and BCIP/NBT. For assay of SapM antigenicity, SapM 
protein was partially purified from the recombinant M. smegmatis strain 
carrying the plasmid pSAP, using methods described previously (Saleh and 
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Belisle. 2000). The protein was run on 14% SDS-PAGE and transfen-ed to 
nitrocellulose membranes. Individual lanes were cut and probed with a 1:150 
dilution of individual human sera and developed as described above. Human 
serum samples were provided by Tony Mazzulli (Mount Sinai Hospital, 
Toronto, ON). 

Fluorescence microscopy 

The mouse macrophage cell line J774 was maintained in RPIVII 1640 media 
containing fetal bovine serum (5 %), 100//g / ml streptomycin, and 100 units / 
ml penicillin. For infections with M. marinum, cells were allowed to adhere to 
glass cover slips (22 x 22 mm) overnight in incubator at 37°C, 5 % CO2- Spent 
media was then replaced with fresh media (without antibiotics) and M. 
marinum was added at MOI of 10:1 and incubated at 37^C for 1 hr. Excess 
bacilli were removed by several washes with sterile PBS and the infected cells 
were finally covered with fresh media and returned to the incubator. At given 
times post Infection, the cover slips were washed several times with PBS, 
fixed for 15 minutes with 3.7% paraformaldehyde, permeabilized for 10 
minutes with 0.1 % Triton X-100, and stained with Texas-red phalloidin for 1 
hr. All treatments were at room temperature. Images were collected on a 
Leica DM IRBE microscope equipped with a Hamamatsu digital camera. 
Images were collected using differential interference contrast (DIG), with a 
FITC filter to visualize GFP, or with a Texas red filter to visualize 
macrophages treated with cytoskeleton-specific stain pholloidin conjugated 
with Texas red. 

The present Invention has been described in detail and with particular 
reference to the preferred embodiments; however, it will be understood by 
one having ordinary skill in the art that changes can be made without 
departing from the spirit and scope thereof. For example, where the 
application refers to proteins, it is clear that peptides and polypeptides may 
often be used. Likewise, where a gene is described in the application, it is 
clear that nucleic acids or gene fragments may often be uised. 

All publications (including Genbank entries), patents and patent applications 
are incorporated by reference in their entirety to the same extent as if each 
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individual publication, patent or paftent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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1 . An isolated DNA sequence comprising a promoter or a promoter 
fragment of a mycobacterial secreted acid pliospliatase gene wlierein the 
promoter or tlie promoter fragment is sufficient to control expression of a 
nucleotide sequence of interest and is inducible under low-pH conditions. 

2. The isolated DNA sequence of claim 1 , wherein the promoter or the 
promoter fragment is selected from the group consisting of a Mycobacterium 
tuberculosis sapM promoter or promoter fragment, a Mycobacterium bovis 
sapM promoter or promoter fragment, a Mycobacterium avium sapM promoter 
or promoter fragment, and a Mycobacterium marinum sapM promoter or 
promoter fragment. 

3. An isolated DNA sequence comprising a promoter or a promoter 
fragment sufficient to control expression of a nucleotide sequence of interest 
and inducible under low-pH conditions, wherein the promoter or the promoter 
fragment hybridize to a sapM promoter selected from the group consisting of 
a Mycobacterium tuberculosis sapM promoter [SEQ ID NO:1], a 
Mycobacterium bovis sapM promoter [SEQ ID NO:2], a Mycobacterium avium 
sapM promoter [SEQ ID NO:3] and a Mycobacterium marinum sapM promoter 
[SEQ ID NO:4] under high stringency hybridization conditions. 

4. An expression vector comprising the isolated DNA sequence of claim 
1.2 or 3. 

5. A host cell transfonned with the vector of claim 4. 

6. A transcription cassette comprising: 

(a) a mycobacterial secreted acid phosphatase promoter or promoter 
fragment, wherein the promoter or the promoter fragment is sufficient to 
control expression of a nucleotide sequence of interest; 

(b) a nucleotide sequence of interest operably linked to the promoter or the 
promoter fragment; and 
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(c) a transcriptional termination region. 

7. The transcription cassette of claim 6 wherein the mycobacteria! 
secreted acid phosphatase promoter or promoter fragment is selected from 
the group consisting of a Mycobacterium tuberculosis sapM promoter [SEQ ID 
NO:1], a Mycobacterium bovis sapM promoter [SEQ ID NO:2], a 
Mycobacterium avium sapM promoter [SEQ ID NO:3], and a Mycobacterium 
marinum sapM promoter [SEQ ID NO:4]. 

8. The transcription cassette of claim 6 or 7 further comprising a 
mycobacterial secreted acid phosphatase N-terminal signal sequence. 

9. The transcription cassette of claim 8 wherein the mycobacteria! 
secreted acid phosphatase N-terminal signal sequence is selected from the 
group consisting of a Mycobacterium tuberculosis sapM N-terminal signal 
sequence [SEQ ID NO:5], a Mycobacterium bovis sapM N-terminal signal 
sequence [SEQ ID NO:6], a Mycobacterium avium sapM N-terminal signal 
sequence [SEQ ID NO:7], and a Mycobacterium marinum sapM N-terminal 
signal sequence [SEQ ID NO:8]. 

10. A method for the diagnosis of a pathogenic mycobacterial infection or a 
pathogenic fungal infection in a subject comprising: 

(a) obtaining a biological sample from a subject; and 

(b) analyzing the sample for the presence of antibodies specific to a 
mycobacterial secreted acid phosphatase wherein detection of antibodies 
specific to the mycobacterial secreted acid phosphatase is indicative of the 
presence of the pathogenic mycobacterial infection or the pathogenic fungal 
infection. 

1 1 . The method of claim 10, wherein the mycobacterial secreted acid 
phosphatase is selected from the group consisting of a Mycobacterium 
tuberculosis SapM [SEQ ID NO:10], a Mycobacterium bovis SapM [SEQ ID 
NO:12], a Mycobacterium avium SapM [SEQ ID NO:14], and a 
Mycobacterium marinum SapM [SEQ ID NO:16]. 
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12. A method for the diagnosis of a pathogenic mycobacterial infection or a 
pathogenic fungal infection in a subject comprising: 

(a) obtaining a nucleic acid sample from a subject; and 

(b) analyzing the sample for the presence of a nucleic acid encoding a 
mycobacterial secreted acid phosphatase wherein detection of the nucleic 
acid encoding the mycobacterial secreted acid phosphatase is indicative of 
the presence of the pathogenic mycobacterial infection or a pathogenic fungal 
Infection. 

13. The method of claim 12 wherein the nucleic acid encoding the 
mycobacterial secreted acid phosphatase is selected from the group 
consisting of a Mycobacterium tuberculosis sapM, a Mycobacterium bovis 
sapM, a Mycobacterium avium sapM, and a Mycobacterium marinum sapM. 

14. A method for the diagnosis of a pathogenic mycobacterial infection or a 
pathogenic fungal infection in a subject comprising: 

(a) obtaining a biological sample from a subject; and 

(b) analyzing the sample for the presence of a mycobacterial secreted acid 
phosphatase activity, wherein detection of the mycobacterial secreted acid 
phosphatase activity is indicative of the presence of the pathogenic 
mycobacterial infection or the pathogenic fungal infection. 

1 5- The method of clainri 14, wherein the mycobacterial secreted acid 
phosphatase activity is selected from the group of mycobacteria consisting of 
a Mycobacterium tuberculosis SapM activity, a Mycobacterium bovis SapM 
activity, a Mycobacterium avium SapM activity, and a Mycobacterium 
marinum SapM activity. 

16. A method of screening for a compound capable of modulating 
production of a mycobacterial secreted acid phosphatase comprising: 
(a) providing a nucleic acid construct comprising a mycobacterial secreted 
acid phosphatase promoter or promoter fragment, wherein the promoter or 
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the promoter fragment is operably linked to a reporter gene capable of 
producing a measurable signal; 

(b) providing a test compound; 

(c) exposing the nucleic acid construct to the test compound; and 

(d) measuring the signal produced by the reporter gene, wherein a change 
in the signal produced in the presence of the test compound as compared to 
in the absence of the test compound indicates that the test compound is 
capable of modulating production of the mycobacterial secreted acid 
phosphatase. 

17. The method of claim 16, wherein the mycobacterial secreted acid 
phosphatase promoter or promoter fragment is selected from the group 
consisting of a Mycobacterium tuberculosis sapM promoter or promoter 
fragment, a Mycobacterium bovis sapM promoter or promoter fragment, a 
Mycobacterium avium sapM promoter or promoter fragment, and a 
Mycobacterium marinum sapM promoter or promoter fragment. 

18. A method of screening for a compound capable of modulating the 
activity of a mycobacterial secreted acid phosphatase comprising: 

(a) incubating a mixture comprising a mycobacterial secreted acid 
phosphatase, a substrate for the mycobacterial secreted acid phosphatase, 
and a test compound; and 

(b) measuring the activity of the mycobacterial secreted acid phosphatase; 
wherein a change in activity in the presence of the test compound as 
compared to in the absence of the test compound indicates that the test 
compound is capable of modulating the activity of the secreted acid 
phosphatase. 

19. The method of claim 18, wherein the mycobacterial secreted acid 
phosphatase is selected from the group consisting of a Mycobacterium 
tuberculosis SapM, a Mycobacterium bovis SapM, a Mycobacterium avium 
SapM, and a Mycobacterium marinum. 
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20- A method of screening for a compound capable of modulating 
secretion of a mycobacterial secreted acid phosphatase comprising: 

(a) exposing mycobacterium cells to a test compound, wherein the 
mycobacterium cells secrete a mycobacterial secreted acid phosphatase and 

(b) measuring the amount of the mycobacterial secreted acid 
phosphatase, secreted by the mycobacterium cells, wherein a change in the 
secretion of the mycobacterial secreted acid phosphatase in the presence of 
the test compound as compared to in the absence of the test compound 
indicates that the test comppund is capable of modulating the secretion of the 
mycobacterial secreted phosphatase. * 

21 . The method of claim 20, wherein the mycobacterial secreted acid 
phosphatase is selected from the group consisting of a Mycobacterium 
tuberculosis SapM, a Mycobacterium bovis SapM, a Mycobacterium avium 
SapM, and a Mycobacterium marinum SapM. 

22. A kit for the detection of a pathogenic mycobacterial disease or 
infection comprising: 

(a) a mycobacterial secreted acid phosphatase; 

(b) at least one antibody specific for the mycobacterial secreted acid 
phosphatase ; and 

(c) one or more reagents necessary for detection of the antibody. 

23. The kit of claim 22, wherein the mycobacterial secreted acid 
phosphatase is selected from the group consisting of Mycobacterium 
tuberculosis SapM, Mycobacterium bovis SapM, Mycobacterium avium SapM, 
and Mycobacterium marir^um SapM. 

24. A kit for the detection of a pathogenic mycobacterial disease or 
infection comprising: 

(a) an oligonucleotide comprising contiguous nucleotides from the nucleic 
acid sequence that is complementary to the sequence of [SEQ ID NO:1] to 
[SEQ ID NO:4], or [SEQ ID NO:9], [SEQ ID NO:11], [SEQ ID NO:13], [SEQ ID 
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NO:15] and capable of specifically hybridizing to the complementary 
nucleotide sequence; and 

(b) reagents for hybridization of the oligonucleotide to a complementary 
nucleic acid sequence. 

25. An antibody which is capable of specifically binding to a mycobacterial 
secreted acid phosphatase or a polypeptide fragment thereof. 

26. The antibody of claim 25, wherein the mycobacterial secreted acid 
phosphatase is selected from the group consisting of Mycobacterium 
tuberculosis SapM [SEQ ID NO:10], Mycobacterium bovis SapM [SEQ ID 
NO:12l, Mycobacterium avium SapM [SEQ ID NO:14], and Mycobacterium 
marinum [SEQ ID NO:16] SapM. 

27. A vaccine or immunogenic composition for treatment or prophylaxis of 
a mammal against challenge by a mycobacterium comprising the antibody of 
claim 26. 

28. A vaccine or immunogenic composition for treatment or prophylaxis of 
a mammal against challenge by a mycobacterium comprising the isolated 
DNA sequence claim 1 ,2 or 3. 

29. A vaccine or immunogenic composition for the treatment or prophylaxis 
of a mammal against challenge by a mycobacterium comprising a 
mycobacterial secreted acid phosphatase selected from the group consisting 
of a Mycobacterium tuberculosis SapM, a Mycobacterium bovis SapM, a 
Mycobacterium avium SapM, and a Mycobacterium marinum SapM or a 
polypeptide fragment thereof. 

30. An antigenic composition useful for the detection of a pathogenic 
mycobacterial disease or infection in a subject comprising a SapM 
polypeptide and substantially free from other proteins or glycoproteins with 
which it is natively admixed in a culture of pathogenic mycobacteria. 
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SEQUENCE LISTING 

<110> Saleh, Mazen T. 

Alexander, David C. 
Liu, Jun 

<120> Secreted Acid Phosphatase (sapM) is Present Only in Pathogenic 
Mycobacteria and Esqpressed Selectively at Phagosomal pH 

<130> 4146 0005 

<140> 

<14l> 

<150> US 60/416,957 

<151> 2002-10-09 

<160> 22 

<170> Patentin version 3.0 

<210> 1 

<211> 500 

<212> DNA 

<213> Mycobacterium tuberculosis 



<400> 1 

catcgggtca agcaccatga ccggtacatc cgtcaggtcg tcgggcagcg agtccagata 60 

cggcaccggc tggtgggttt gctcgtcgcg ggcgacaccg acaaagccaa cg-t'gcgcctc 12 0 

cggcaaggcg gcatgcgcct cgtcgaccat ccccaacccc gcccgcaaca caggaa(|aag 180 

caggggtggc ttggttagcc gcgacccgac cgtctcggcc agcggcgtac ggatcgggac 240 

tggctcgcag ggcgcatcgc gggtggcctc atagatcaac agcagcgtga gctcgcgcag 300 

cgctgcccgg aagccggcgt tgtcggtgcg ttcgtcacgc agcgtggtca gtcgggccgc 360 

ggccagtggg tggtcaacga catggacctg cacggcgttg aaccctatat aacaatcgtg 420 

gctcggtccc ctaaaagggg gctgatacgg gtgcgtccat ccgcgcgacc ggtcaacccc 480 

gtccatatac tcccggcatg 500 

<210> 2 
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<211> 500 
' <212> DMA 
<213> Mycobacterium bovis 



<400> 2 

catcgggtca agcaccatga ccggtadatc cgtcaggtcg tcgggcagcg agtccagata 60 

cggcaccggc tggtgggttt gctcgtcgcg ggcgacaccg acaaagccaa cgtgcgcctc 120 

cggcaaggcg gcatgcgcct cgbcgaccat ccccaacccc gcccgcaaca caggaaccag 180 

caggggtggc ttggttagcc gcgacccgac cgtctcggcc agcggcgtac ggatcgggac 240 

tggctcgcag ggcgcatcgc gggtggcctc atagatcaac agcagcgtga gctcgcgcag 300 

cgctgcccgg aagccggcgt tgtcggtgcg ttcgtcacgc agcgtggtca gtcgggccgc 360 

ggccagtggg tggtcaacga catggacctg cacggcgttg aaccctatat aacaatcgtg 420 

gctcggtccc ctaaaagggg gctgatacgg gtgcgtccat ccgcgcgacc ggtcaacccc 480 

gtccatatac tcccggcatg 500 

<210> 3 

<211> 500 

<212> DNA 

<213> Mycobacterium avium 



<400> 3 

gccggtggcc agcatcgggt ccagcaccat caccgacctg ccggccagct tgtcgggcag 60 

cgccgccaaa tacgggaccg gccggtgggt ttgctcgtcg cgggcgacgc cgacgaagcc 120 

cacctcggcc tcgggcagcg cggcctgcgc cgggtcgacc atgcccagcc cggcgcgcag 180 

caccggaacc agcagcggcg gattgaccag tcgggtcccg gcggccgcgg ccaccggggt 240 

gcggatccgg accgacttgc gcggcgcgtc gcggctggcc tcgtagacca gcatcagcgt 300 

cagatcgcgc agcgcggccc gaaatccggc ggtgtcggtg cgttcgtcgc gcagcaccgt 360 

cagccgggcc- gcggccaacg ggtggtcgat cacgcacacg tccatctggt cgagggtata 420 

taacgatcgg gcaaagcccc gctgacacgc ttgcccgccg gccggaaacg ccttaccgcc 480 

gttcgtatac tccgggcgtg 500 

<210> 4 

<211> 500 

<212> DNA 

<213> Mycobacterium marinum 
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PCT/CA2003/001554 



<400> 4 
cgccggtggc 


cagcatcggg 


tccagcacca 


ttacgggtag 


cccggacaag tcgtcgggca 


60 


gcgattcaag 


atatgggacg 


ggctggtggg 


tctgctcatt 


gcgggcgata ccgacaaagc 


120 


cgacccgcgc 


ctccggcagc 


gccgcatgcg 


cttcgtcgac 


catgcccagt ccggcgcgca 


180 


gcaccggaac 


cagaagcggg 


gggttggcca gtcttaggcc 


tgtcgtggcc gcaagcggtg 


240 


tacggatagc 


gacggattcg 


gtgggcgctg 


cgcgggtcgc 


ctcatagacc agtaccagtg 


300 


tcagctcacg 


caaggccbtg 


cggaaagcag 


cgtttccggt gcgttcgtca cgcagcgcgg 


360 


tcaggcgggc 


cgccgccagc 


gggtgatcaa 


tgacgtggac 


ttccacatgg gtgaccctat 


420 


ataacaatcg 


gattcaagcc 


gctgacacgc 


tccccctcct 


cgcggcgccg aggccgagcc 


- 480 


gcccatatac 


tccgggcgtg 








500 



<210> S 

<211> 43 

<212> PRT 

<213> Mycobacterium tuberculosis 



<400> 5 

Met Leu Arg Gly lie Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
15 10 15 

Arg Ala Leu Ala Val lie Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 

Trp Val Gly Ala Val Pro Gin Val Gly Leu Ala 
35 40 

<210> 6 
<211> 43 
<212> PRT 

<213> Mycobacterium bovis 
<400> 6 

Met Leu Arg Gly lie Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
15 10 15 

Arg Ala Leu Ala Val lie Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 

Trp Val Gly Ala Val Pro Gin Val Gly Leu Ala 
35 40 

<210> 7 
<211> 42 
<212> PRT 
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<213> Mycobacterixim avium 

<400> 7 

Met Ser Arg ciu Asn Arg Ser Arg Arg Arg Leu He Gly Gly Ala Tyr 
15 10 15 

Arg Ser Leu Arg Leu Leu Gly Ala Val Ala Ala Val Ala Leu Ala Ala 
20 25 30 * 

Ser Pro Leu Thr Pro Arg Thr Ser Leu Ala 





35 40 


<210> 


8 


<211> 


40 


<212> 


PRT 


<213> 


Mycobacterium marinum 



<400> 8 

Met Cy& Gly Leu Lys Gin Arg Phe Thr Ser Thr Phe Arg Ala Leu Ala 
15 10 15 

Val Leu Gly Ala Val Ala Val Ser Leu Pro Ala His Gly Ser Asp Ala 
20 25 30 

Pro Pro Arg He Asp Leu Thr Ala 
35 40 

<210> 9 
<211> 900 
<212> DNA 

<213> Mycobacterium tuberculosis 



<220> 

<221> CDS 

<222> (I).. (900) 



<400> 9 

atg etc cgc gga ate cag get etc age egg ecc ctg acc agg gta tac 48 
Met Leu Arg Gly He Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
1 5 10 15 

egt gee ttg geg gtg ate ggt gtc ctg gca gca teg ttg ctg gcc tea 96 
Arg Ala Leu Ala Val He Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 

tgg gtc gge get gtc cea caa gtg ggt ctg gca geg agt gee ctg ceg 144 
Trp Val Gly Ala Val Pro Gin Val Gly Leu Ala Ala Ser Ala Leu Pro 
35 40 45 

acc tte geg cac gtg gtc ate gtg gtg gag gag aac cgc teg cag gee 192 
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Thr Phe Ala His VaX Val lie Val Val Glu Glu Asn Arg Ser Gin Ala 
50 55 60 



gcc ate ate ggt aac aag teg get ccc ttc ate aat teg etg gee gee 
Ala lie lie Gly Asn Lys Ser Ala Pro Phe He Asn Ser Leu Ala Ala 
65 70 75 80 



gcg gtc ggc tec acg gtg tge agt gcg gge aaa tae gca cge aaa cac 
Ala Val Gly Ser Thr Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His 
145 150 155 160 



240 



aac ggc gcg atg atg gee cag gcg ttc gee gaa aca cac ceg age gaa 288 
Asn Gly Ala Met Met Ala Gin Ala Phe Ala Glu Thr His Pro Ser Glu 
85 90 95 

ceg aac tae etg gca etg ttc get gge aac aca ttc ggg ttg acg aag 336 
Pro Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Leu Thr Lys 
100 105 110 

aac acc tgc ccc gtc aac ggc ggc gcg etg ccc aac etg ggt tct gag 384 
Asn Thr Cys Pro Val Asn Gly Gly Ala Leu Pro Asn Leu Gly Ser Glu 
115 120 125 

ttg etc age gcc ggt tac aca ttc atg ggg ttc gcc gaa gac ttg cct 432 
Leu Leu Ser Ala Gly Tyr Thr Phe Met Gly Phe Ala Glu Asp Leu Pro 
130 135 140 



480 



gtg ceg tgg gtc aac ttc agt aac gtg ceg acg aca etg teg gtg ceg 528 
Val Pro Trp Val Asn Phe Ser Asn Val Pro Thr Thr Leu Ser Val Pro 
165 170 175 

ttt teg gca ttt ceg aag ceg cag aat tac ccc ggc etg ceg acg gtg 576 
Phe Ser Ala Phe Pro Lys Pro Gin Asn Tyr Pro Gly Leu Pro Thr Val 
180 185 190 

teg ttt gtc ate cct aac gcc gac aac gac atg cac gac ggc teg ate 624 
Ser Phe Val He Pro Asn Ala Asp Asn Asp Met His Asp Gly Ser He 
195 200 205 

gee caa ggc gac gee tgg etg aac egc cac etg teg gca tat gee aac 672 
Ala Gin Gly Asp Ala Trp Leu Asn Arg His Leu Ser Ala Tyr Ala Asn 
210 215 220 

tgg gee aag aca aac aac age etg etc gtt gtg acc tgg gac gaa gac 720 
Trp Ala Lys Thr Asn Asn Ser Leu Leu Val Val Thr Trp Asp Glu Asp 
225 230 235 240 

gac ggc age age cge aat cag ate ceg acg gtg ttc tac ggc gcg cac 768 
Asp Gly Ser Ser Arg Asn Gin He Pro Thr Val Phe Tyr Gly Ala His 
245 250 255 

gtg egg ccc gga act tac aac gag acc ate age cac tac aac gtg etg 816 
Val Arg Pro Gly Thr Tyr Asn Glu Thr He Ser His Tyr Asn Val Leu 
260 265 270 

tec aea ttg gag cag ate tae gga etg ccc aag acg ggt tat gcg acc 864 
Ser Thr Leu Glu Gin He Tyr Gly Leu Pro Lys Thr Gly Tyr Ala Thr 
275 280 285 

aat get ceg cea ata acc gat att tgg ggc gac tag '900 
Asn Ala Pro Pro He Thr Asp He Trp Gly Asp 
290 295 



<210> 10 
<211> 299 
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<212> PRT 

c213> Mycobacterium tuberculosis 
<400> 10 

Met Leu Arg Gly lie Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
15 10 15 

Arg Ala Leu Ala Val lie Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 

Trp Val Gly Ala Val Pro Gin Val Gly Leu Ala Ala Ser Ala Leu Pro 
35 40 45 

Thr Phe Ala His Val Val lie Val Val Glu Glu Asn Arg Ser Gin Ala 
50 55 60 

Ala lie He Gly Asn Lys Ser Ala Pro Phe He Asn Ser Leu Ala Ala 
65 70 75 80 

Asn Gly Ala Met Met Ala Gin Ala Phe Ala Glu Thr His Pro Ser Glu 
85 90 95 

Pro Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Leu Thr Lys 
100 105 110 

Asn Thr Cys Pro Val Asn Gly Gly Ala Leu Pro Asn Leu Gly Ser Glu 
115 120 125 

Leu Leu Ser Ala Gly Tyr Thr Phe Met Gly Phe Ala Glu Asp Leu Pro 
130 135 140 

Ala Val Gly Ser Thr Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His 
145 150 155 160 

Val Pro Trp Val Asn Phe Ser Asn Val Pro Thr Thr Leu Ser Val Pro 
165 170 175 

Phe Ser Ala Phe Pro Lys Pro Gin Asn Tyr Pro Gly Leu Pro Thr Val 
180 185 190 

Ser Phe Val He Pro Asn Ala Asp Asn Asp Met His Asp Gly Ser He 
195 200 205 

Ala Gin Gly Asp Ala Trp Leu Asn Arg His Leu Ser Ala Tyr Ala Asn 
210 215 220 

Trp Ala Lys Thr Asn Asn Ser Leu Leu Val Val Thr Trp Asp Glu Asp 
225 230 235 240 
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Asp Gly Ser Ser Arg Asn Gin lie Pro Thr Val Phe Tyr Gly Ala His 
245 250 255 

Val Arg Pro Gly Thr Tyr Asn Glu Thr He Ser His Tyr Asn Val Leu 
260 265 270 

Ser Thr Leu Glu Gin He Tyr Gly Leu Pro Lys Thr Gly Tyr Ala Thr 
275 280 285 

Asn Ala Pro Pro He Thr Asp He Trp Gly Asp 
290 295 

<210> 11 

<211> 900 

<212> DNA 

<213> Mycobacterium bovis 



<220> 

<221> C!DS 

<222> (1)..(900) 



<400> 11 

atg etc cgc gga ate cag get etc age egg ccc etg aee agg gta tac 48 
Met Leu Arg Gly He Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
15 10 15 

cgt gee ttg gcg gtg ate ggt gte ctg gca gca teg ttg etg gee tea 96 
Arg Ala Leu Ala Val He Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 

tgg gtc ggc get gtc cca caa gtg ggt ctg gca gcg agt gcc ctg cog 144 
Trp Val Gly Ala Val Pro Gin Val Gly Leu Ala Ala Ser Ala Leu Pro 
35 40 45 

acc ttc gcg cac gtg gtc ate gtg gtg gag gag aac cgc teg cag gcc 192 
Thr Phe Ala His Val Val He Val Val Glu Glu Asn Arg Ser Gin Ala 
50 55 60 

gcc ate ate ggt aac aag teg get "ccc ttc ate aat teg etg gcc gcc 240 
. Ala He He Gly Asn Lys Ser Ala Pro Phe He Asn Ser Leu Ala Ala 
65 70 75 80 

aac ggc gcg atg atg gcc cag gcg ttc gee gaa aca cac ccg age gaa 288 
Asn Gly Ala Met Met Ala Gin Ala Phe Ala Glu Thr His Pro Ser Glu 
85 90 95 

ccg aac tac ctg gca ctg ttc get ggc aac aca ttc ggg ttg acg aag 336 
Pro Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Leu Thr Lys 
100 105 110 

aac acc tgc ccc gtc aac ggc ggc gcg ctg ccc aac ctg ggt tet gag 384 
Asn Thr Cys Pro Val Asn Gly Gly Ala Leu Pro T^n Leu Gly Ser Glu 
115 120 125 

ttg etc age gcc ggt tac aca ttc atg ggg ttc gee gaa gae ttg cet 432 
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Leu Leu Ser Ala Gly Tyr Thr Phe Met Gly Phe Ala Glu Asp Leu Pro 
130 135. 140 

gcg gtc ggc tec acg gtg tgc agt gcg ggc aaa tac gca cgc aaa cac 480 
Ala Val Gly Ser Thr Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His 
145 150 155 160 

gtg ccg tgg gtc aac ttc agt aac gtg ccg gcg aca ctg teg gtg ccg 528 
Val Pro Trp Val Asn Phe Ser Asn Val Pro Ala Thr Leu Ser Val Pro 
165 170 175 

ttt teg gca ttt ccg aag ccg cag aat tac ccc ggc ctg ccg acg gtg 576 
Phe Ser Ala Phe Pro Lys Pro Gin Asn Tyr Pro Gly Leu Pro Thr Val 
180 185 190 

teg ttt gtc ate cct aac gee gac aac gac atg cac gac ggc teg ate 624 
Ser Phe Val lie Pro Asn Ala Asp Asn Asp Met His Asp Gly Ser lie 
195 200 205 

gee caa ggc gac gee tgg ctg aac cgc cac ctg teg gca tat gee aac 672 
Ala Gin Gly Asp Ala Trp Leu Asn Arg His Leu Ser Ala Tyr Ala Asn 
210 215 220 

tgg gcc aag aca aac aac age ctg etc gtt gtg acc tgg gac gaa gac 720 
Trp Ala Lys Thr Asn Asn Ser Leu Leu Val Val Thr Trp Asp Glu Asp 
225 230 235 240 

gac ggc age age cgc aat cag ate ccg acg gtg ttc tac ggc gcg cac 768 
Asp Gly Ser Ser Arg Asn Gin lie Pro Thr Val Phe Tyr Gly Ala His 
245 250 255 

gtg egg ccc gga act tac aac gag acc ate age cac tac aac gtg ctg 816 
Val Arg Pro Gly Thr Tyr Asn Glu Thr He Ser His Tyr Asn Val Leu 
260 265 270 

tec aca ttg gag cag ate tac gga ctg ccc aag acg ggt tat gcg acc 864 
Ser Thr Leu Glu Gin He Tyr Gly Leu Pro Lys Thr Gly Tyr Ala Thr 
275 280 285 

aat get ccg cea ata acc gat att tgg ggc gac tag 900 
Asn Ala Pro Pro He Thr Asp He Trp Gly Asp 
290 295 

<210> 12 

<211> 299 ... 

<212> PRT 

<213> Mycobacterium bovis 



<400> 12 

Met Leu Arg Gly He Gin Ala Leu Ser Arg Pro Leu Thr Arg Val Tyr 
15 10 15 



Arg Ala Leu Ala Val He Gly Val Leu Ala Ala Ser Leu Leu Ala Ser 
20 25 30 



Trp Val. Gly Ala Val Pro Gin Val Gly Leu Ala Ala Ser Ala Leu Pro 
35 40 45 
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Thr Phe Ala His Val Val He Val Val Glu Glu Asn Arg Ser Gin Ala 
50 55 60 

Ala He He Gly Asn Lys Ser Ala Pro Phe He Asn Ser Leu Ala Ala 
65 70 75 80 

Asn Gly Ala Met Met Ala Gin Ala Plie Ala Glu Thr His Pro Ser Glu 
85 90 95 

Pro Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Leu Thr Lys 
100 105 110 

Asn Thr Cys Pro Val Asn Gly Gly Ala Leu Pro Asn Leu Gly Ser Glu 
115 . 120 125 

Leu Leu Ser Ala Gly Tyr Thr Phe Met Gly Phe Ala Glu Asp Leu Pro 
130 135 140 

Ala Val Gly Ser Thr Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His 
145 150 155 160 

Val Pro Trp Val Asn Phe Ser Asn Val Pro Ala Thr Leu Ser Val Pro 
165 170 175 

Phe Ser Ala Phe Pro Lys Pro Gin Asn Tyr Pro Gly Leu Pro Thr Val 
180 185 190 

Ser Phe Val He Pro Asn Ala Asp Asn Asp Met His Asp Gly Ser He 
195 200 205 

Ala Gin Gly Asp Ala Trp Leu Asn Arg His Leu Ser Ala Tyr Ala Asn 
210 215 220 

Trp Ala Lys Thr Asn Asn Ser Leu Leu Val Val Thr Trp Asp Glu Asp 
225 230 235 240 

Asp Gly Ser Ser Arg Asn Gin He Pro Thr Val Phe Tyr Gly Ala His 
245 250 255 

Val Arg Pro Gly Thr Tyr Asn Glu Thr He Ser His Tyr Asn Val Leu 
260 265 270 

Ser Thr Leu Glu Gin He Tyr Gly Leu Pro Lys Thr Gly Tyr Ala Thr 
275 280 285 

Asn Ala Pro Pro He Thr Asp He Trp Gly Asp 
290 295 

<210> 13 
<211> 903 
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<212> DNA 

<213> Mycobacterium avium 
<220> 

<221> CDS 

<222> (1)..(903) 



<400> 13 

gtg teg cgc gaa aat cga agt cgc aga agg ctg ate ggc ggc gca tac 48 
Met Ser Arg Glu Asn Arg Ser Arg Arg Arg Leu lie Gly Gly Ala Tyr 
15 10 15 

cga age ctg egg ctg etc ggc gee gtg gee gcg gtg geg ctg gcg gcc 96 
Arg Ser Leu Arg Leu Leu Gly Ala Val Ala Ala Val Ala Leu Ala Ala 
20 25 30 

age ccg ttg aca ccg cgc acc age ctt gcg gca gcg gee att ccg caa 144 
Ser Pro Leu Thr Pro Arg Thr Ser Leu Ala Ala Ala Ala lie Pro Gin 
35 40 45 

ccg teg cac ate gtg ate gtg gtg gag gaa aac cgt tec gag age ggc 192 
Pro Ser His lie Val lie Val Val Glu Glu Asn Arg Ser Glu Ser Gly 
50 55 60 

ate ate ggc aac aag teg gcg cec ttc ate ace gcg ctg gcc gcg tec 240 
lie He Gly Asn Lys Ser Ala Pro Phe He Thr Ala Leu Ala Ala Ser 
65 70 75 80 

ggc gcc aac atg acc cag teg ttc gcc gaa acc cac cec age gag ecc 288 
Gly Ala Asn Met Thr Gin Ser Phe Ala Glu Thr His Pro Ser Glu Pro 
85 90 95 

aat tac ctg gcg ctg ttc gcc ggc aac acg ttc ggg gtg acc aag gac 336 
Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Val Thr Lys Asp 
100 105 110 

ctg tgc ccg gtc aac gcc ggc gcc gca ecc aac ctg ggg tec gaa ttg 384 
Leu Cys Pro Val Asn Ala Gly Ala Ala Pro Asn Leu Gly Ser Glu Leu 
115 120 125 

etc gcc gee ggt tac aca ttc gee ggc tac gee gag ggc ctg ccg tec 432 
Leu Ala Ala Gly Tyr Thr Phe Ala Gly Tyr Ala Glu Gly Leu Pro Ser 
130 135 140 

ccg ggc tea ccg gtg tgc agt geg ggc aag tac geg cga aaa eat gtg 480 
Pro Gly Ser Pro Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His Val 
145 150 155 160 

ccg tgg gcc aac ttc acc aac gtg ccg gcg gcg age teg ctg ccg ttc 528 
Pro Trp Ala Asn Phe Thr Asn Val Pro Ala Ala Ser Ser Leu Pro Phe 
165 170 175 

teg gcg ttc ccg atg ggc aac tac gee age ctg ccg acg gtg teg ttc 576 
Ser Ala Phe Pro Met Gly Asn Tyr Ala Ser Leu Pro Thr Val Ser Phe 
180 185 190 

gtc ate ccg aac aac gac aac aac atg cac gac ggc teg ate geg cag 624 
Val He Pro Asn Asn Asp Asn Asn Met His Asp Gly Ser He Ala Gin 
195 200 205 

gcc gac gee tgg ctg aac egg cag ctg tec ggc tac gee aat tgg gcg 672 
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Ala Asp Ala Trp Leu Asn Arg Gin Leu Ser Gly Tyr Ala Asn Trp Ala 

210 215 220 

ctg gcc aac aac age ctg ctg ate gtc acc ttc gac gag gac gac aac 720 
Leu Ala Asn Asn Ser Leu Leu lie Val Thr Phe Asp Glu Asp Asp Asn 
225 230 235 240 

tec aac gtc gga gcc age egc aac cag ate cec acg gtg ttc tac ggc 768 
Ser Asn Val Gly Ala Ser Arg Asn Gin lie Pro Thr Val Phe Tyr Gly 
245 250 255 

gcc cac gtc cgc ccc ggc aac tac gcc gag cag ate aac cac tac aac 816 
Ala His Val Arg Pro Gly Asn Tyr Ala Glu Gin lie Asn His Tyr Asn 
260 265 270 

gtg ctt gcc acc etc gag cag atg tac ggg ctg ccc aag acg ggc tat 864 
Val Leu Ala Thr Leu Glu Gin Met Tyr Gly Leu Pro Lys Thr Gly Tyr 
275 280 285 

gcc gcc ggc gcc gcc ccc ate acc gac ate tgg ggc tga 903 
Ala Ala Gly Ala Ala Pro lie Thr Asp lie Trp Gly 
290 295 300 

<210> 14 

<211> 300 

<212> PRT 

<213> Mycobacterium avium 



<400> 14 

Met Ser Arg Glu Asn Arg Ser Arg Arg Arg Leu lie Gly Gly Ala Tyr 
15 10 15 



Arg Ser Leu Arg Leu Leu Gly Ala Val Ala Ala Val Ala Leu Ala Ala 
20 25 30 



Ser Pro Leu Thr Pro Arg Thr Ser Leu Ala Ala Ala Ala lie Pro Gin 
35 40 45 



Pro Ser His He Val He Val Val Glu Glu Asn Arg Ser Glu Ser Gly 
50 55 60 



He He Gly Asn Lys Ser Ala Pro Phe He Thr Ala Leu Ala Ala Ser 
65 70 75 80 



Gly Ala Asn Met Thr Gin Ser Phe Ala Glu Thr His Pro Ser Glu Pro 
85 90 95 



Asn Tyr Leu Ala Leu Phe Ala Gly Asn Thr Phe Gly Val Thr Lys Asp 
100 105 110 



Leu Cys Pro Val Asn Ala Gly Ala Ala Pro Asn Leu Gly Ser Glu Leu 
115 120 125 



11 



wo 2004/033677 PCT/CA2003/001554 

Leu Ala Ala Gly Tyr Thr Phe Ala Gly Tyr Ala Glu Gly Leu Pro Ser 
130 135 140 

Pro Gly Ser Pro Val Cys Ser Ala Gly Lys Tyr Ala Arg Lys His Val 
145 150 155 * 160 

Pro Trp Ala Asn Phe Thr Asn Val Pro Ala Ala Ser Ser Leu Pro Phe 
165 170 175 

Ser Ala Phe Pro Met Gly Asn Tyr Ala Ser Leu Pro Thr Val Ser Phe 
180 185 190 

Val lie Pro Asn Asn Asp Asn Asn Met His Asp Gly Ser lie Ala Gin 
195 200 205 

Ala Asp Ala Trp Leu Asn Arg Gin Leu Ser Gly Tyr Ala Asn Trp Ala 
210 215 220 

Leu Ala Asn Asn Ser Leu Leu lie Val Thr Phe Asp Glu Asp Asp Asn 
225 230 235 240 

Ser Asn Val Gly Ala Ser Arg Asn Gin lie Pro Thr Val Phe Tyr Gly 
245 250 255 

Ala His Val Arg Pro Gly Asn Tyr Ala Glu Gin lie Asn His Tyr Asn 
260 265 270 

Val Leu Ala Thr Leu Glu Gin Met Tyr Gly Leu Pro Lys Thr Gly Tyr 
275 280 285 

Ala Ala Gly Ala Ala Pro lie Thr Asp lie Trp Gly 
290 295 300 

<210> 15 

<211> . 888 . ... ... 

<212> DNA 

<213> Mycobacterium marinum 



<220> 

<221> C!DS 

<222> (1)..(888) 

<400> 15 

gtg tgt ggc ctg aaa cag cgt ttt acc agt aca ttt cga get ctg gcg 48 
Met Cys Gly Leu Lys Gin Arg Phe Thr Ser Thr Phe Arg Ala Leu Ala 
15 10 15 

gta etc ggc gcg gtg gcg gta tec eta ccg gcc cac ggt age gae get 96 
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Val Leu Gly Ala Val Ala Val Ser Leu Pro Ala His Gly Ser Asp Ala 
20 25 30 

ccc ccg cgt ate gac ctg acc gcc act gcg ttg ccg gcg ttc tea cat 144 
Pro Pro Arg lie Asp lieu Thr Ala Thr Ala Leu Pro Ala Phe Ser His 
35 • 40 45 

gtg gtg gtc gtg gtg gag gag aac cat teg cag gcc aac ate att ggc 192 
Val Val Val Val Val Glu Glu Asn His Ser Gin Ala Asn lie lie Gly 
50 55 60 

aac aag gcg gcc ccg ttc ate aat gcg ctg gcc gee aat ggc gcg atg 240 
Asn Lys Ala Ala Pro Phe He Asn Ala Leu Ala Ala Asn Gly Ala Met 
65 70 75 80 

atg teg cag teg ttc gee gaa acg eae eee age gaa ecc aae tae ctg 288 
Met Ser Gin Ser Phe Ala Glu Thr His Pro Ser Glu Pro Asn Tyr Leu 
85 90 95 

gcc ttg ttc gcc ggt ace ace ttc ggc ttg aag aag aac acg tgt ccg 336 
Ala Leu Phe Ala Gly Thr Thr Phe Gly Leu Lys Lys Asn Thr Cys Pro 
100 105 110 

gtc aat gcg ggc age acg ccc aac ctg get teg gag ttg etc gcc gcg 384 
Val Asn Ala Gly Ser Thr Pro Asn Leu Ala Ser Glu Leu Leu Ala Ala 
115 * 120 125 

ggc eae acg ttc gta ggt ttc gcc gag age ctg ccc gaa gtc ggt teg 432 
Gly His Thr Phe Val Gly Phe Ala Glu Ser Leu Pro Glu Val Gly Ser 
130 135 140 

aeg gtc tgc age gee gga aag tae ggg cge aag eat gcg cet tgg gtg 480 
Thr Val Cys Ser Ala Gly Lys Tyr Gly Arg Lys His Ala Pro Trp Val 
145 150 155 160 

aac ttc age aat gtt ccg gcc acg ctg teg atg eee ttc tec gcg ttt 528 
Asn Phe Ser Asn Val Pro Ala Thr Leu Ser Met Pro Phe Ser Ala Phe 
165 170 175 

ccg acg ccg gcg gac tac gcc agg ctg ccc acg gtg tec ttc gtc ate 576 
Pro Thr Pro Ala Asp Tyr Ala Arg Leu Pro Thr Val Ser Phe Val He 
180 185 190 

ccc aac ggg gat aac aac atg cac gac ggc acc ate gcg gca get gac 624 
Pro Asn Gly Asp Asn Asn Met His Asp Gly Thr He Ala Ala Ala Asp 
195 200 205 

gag tgg ttg aac cgt caa ctg teg ccg tae gee aae tgg gee ega tec 672 
Glu Trp Leu Asn Arg Gin Leu Ser Pro Tyr Ala Asn Trp Ala Arg Ser 
210 215 220 

aae aae age ctg ctg ate gtg aeg tgg gat gag gac gac ggc ggc age 720 
Asn Asn Ser Leu Leu He Val Thr Trp Asp Glu Asp Asp Gly Gly Ser 
225 230 235 240 

cge aae cag att ccc acg gtg ttc tac ggc gca cac gta egg ccg ggc 768 
Arg Asn Gin He Pro Thr Val Phe Tyr Gly Ala His Val Arg Pro Gly 
245 250 255 

act tac aac cag acc ate age cac tac aac gtg ett tec acg ctg gag 816 
Thr Tyr Asn Gin Thr He Ser His Tyr Asn Val Leu Ser Thr Leu Glu 
260 265 270 

cag atg tae ggc ttg ccc aag aeg ggt ttc gcg gcg aac gee ccg gtc 864 
Gin Met Tyr Gly Leu Pro Lys Thr Gly Phe Ala Ala Asn Ala Pro Val 
275 280 285 

ate get gat ate tgg ggc ggc taa 888 
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He Ala Asp He Trp Gly Gly 
290 295 



<210> 16 
<211> 295 
<212> PRT 

<213> Mycobacterium marinum 
<400> 16 

Met Cys Gly Leu Lys Gin Arg Phe Thr Ser Thr Phe Arg Ala Leu Ala 
15 10 15 

Val Leu Gly Ala Val Ala Val Ser Leu Pro Ala His Gly Ser Asp Ala 
20 25 30 

Pro Pro Arg He Asp Leu Thr Ala Thr Ala Leu Pro Ala Phe Ser His 
35 40 45 

Val Val Val Val Val Glu Glu Asn His Ser Gin Ala Asn He He Gly 
50 55 60 

Asn Lys Ala Ala Pro Phe He Asn Ala Leu Ala Ala Asn Gly Ala Met 
65 70 75 80 



Met Ser Gin Ser Phe Ala Glu Thr His Pro Ser Glu Pro Asn Tyr Leu 
85 90 . 95 



Ala Leu Phe Ala Gly Thr Thr Phe Gly Leu Lys Lys Asn Thr Cys Pro 
100 105 110 



Val Asn Ala Gly Ser Thr Pro Asn Leu Ala Ser Glu Leu Leu Ala Ala 
115 120 125 



Gly His Thr Phe Val Gly Phe Ala Glu Ser Leu Pro Glu Val Gly Ser 
130 135 140 



Thr Val Cys Ser Ala Gly Lys Tyr Gly Arg Lys His Ala Pro Trp Val 
145 150 155 160 



Asn Phe Ser Asn Val Pro Ala Thr Leu Ser Met Pro Phe Ser Ala Phe 
165 170 175 



Pro Thr Pro Ala Asp Tyr Ala Arg Leu Pro Thr Val Ser Phe Val He 
180 185 190 



Pro Asn Gly Asp Asn Asn Met His Asp Gly Thr He Ala Ala Ala Asp 
195 200 205 
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Glu Trp Leu Asn Arg Gin Leu Ser Pro Tyr Ala Asn Trp Rla. Arg ser 
210 215 220 

Asn Asn Ser Leu Leu He Val Thr Trp Asp Glu Asp Asp Gly Gly Ser 
225 230 235 240 

Arg Asn Gin He Pro Thr Val Phe Tyr Gly Ala His Val Arg Pro Gly 
245 250 255 

Thr Tyr Asn Gin Thr He Ser His Tyr Asn Val Leu Ser Thr Leu Glu 
260 265 270 

Gin Met Tyr Gly Leu Pro Lys Thr Gly Phe Ala Ala Asn Ala Pro Val 
275 280 285 

He Ala Asp He Trp Gly Gly 
290 295 

<210> 17 

<211> 1291 

<212> DNA 

<213> Penici Ilium chrysogenum 



<220> 

<221> CDS 

<222> (1)..(213) 

<220> 

<221> intro 

<222> (217.) ....(244) . 

<220> 

<221> CDS 

<222> (245) . . (1288) 



<400> 17 

atg etc acc aaa caa acc ctt etc gcg ttc gtc ggg gcc etc gee etc 48 
Met Leu Thr Lys Gin Thr Leu Leu Ala Phe Val Gly Ala Leu Ala Leu 
1 5 10 . 15 

gcc acg ggt aca act acc act gaa gag acc cca act cag get gag att 96 
Ala Thr Gly Thr Thr Thr Thr Glu Glu Thr Pro Thr Gin Ala Glu He 
20 25 30 

gat gea gea cgt get acg gee ctg ect tac tet cct gtg tea aac gta 144 



15 



wo 2004/033677 PCT/CA2003/001554 

Asp Ala Ala Arg Ala Thr Ala Leu Pro Tyr Ser Pro Val Ser Asn Val 
35 40 45 

aag ggt ttg gcc ttt gat cgt ttc gtg aac ate tgg etc gag aac aca 192 
Xiys Gly Leu Ala Phe Asp Arg Phe Val Asn lie Trp Leu Glu Asn Thr 
SO 55 60 

gta ggt ttc ccg ttg aat ata taacaatgac cacgcgctca cccctttgta g 244 
Val Gly Phe Pro Leu Asn lie 
65 70 

gac ttt gaa ccc get get tta gac gag aac. ctg tec aag etg gee aag 292 
Asp Phe Glu Pro Ala Ala Leu Asp Glu Asn Leu Ser Lys Leu Ala Lys 
75 80 85 

gag ggt ate etc ctg acc aac tac ttt gcc ate tct cac ccc teg cag 340 
Glu Gly lie Leu Leu Thr Asn Tyr Phe Ala He Ser His Pro Ser Gin 
90 95 100 

ccc aac tac t^t get tec gcc ggg ggt gac aca ttc ggc atg gat aat 388 
Pro Asn Tyr Cys Ala Ser Ala Gly Gly Asp Thr Phe Gly Met Asp Asn 
105 110 115 

gac gae ttc eta eaa ate ect teg aat gtc tea act att gcc gat etc 436 
Asp Asp Phe Leu Gin He Pro Ser Asn Val Ser Thr He Ala Asp Leu 
120 125 130 135 

ttt gat act aag eac ate tet tgg ggt gaa tac eaa gaa gae atg eec 484 
Phe Asp Thr Lys His He Ser Trp Gly Glu Tyr Gin Glu Asp Met Pro 
14.0 145 150 

tat get ggc tac eaa ggc aaa egg tat ccc etc age ggt ccg aac cag 532 
Tyr Ala Gly Tyr Gin Gly Lys Arg Tyr Pro Leu Ser Gly Pro Asn Gin 
155 160 165 

tac gtg cgc aag eac aac ccg ctg gtt ttg ttt aac teg gtt acc gae 580 
Tyr Val Arg Lys His Asn Pro Leu Val Leu Phe Asn Ser Val Thr Asp 
170 175 180 

gae gee gtg cgc eeg cgc eaa ate aag aat ttc ace act ttc tac gae 628 
Asp Ala Val Arg Pro Arg Gin He Lys Asn Phe Thr Thr Phe Tyr Asp 
185 190 195 

gat ctg aag eac cae age ctt eec eaa eac atg ttc ate aca eeg aac 676 
Asp Leu Lys His His Ser Leu Pro Gin His Met Phe He Thr Pro Asn 
200 205 210 215 

atg acc aat gac gcc cac gac acg aac ate act gtg gcc ggt aac tgg 724 
Met Thr Asn Asp Ala His Asp Thr Asn He Thr Val Ala Gly Asn Trp 
220 225 230 

gtc gat cgc ttc ctg tct ect eta ctg aag aac gag tac ttc acc aag 772 
Val Asp Arg Phe Leu Ser Pro Leu Leu Lys Asn Glu Tyr Phe Thr Lys 
235 240 245 

gac age eta gtg eta etc acc ttt gac gag gga gac acc tac tec tac 820 
Asp Ser Leu Val Leu Leu Thr Phe Asp Glu Gly Asp Thr Tyr Ser Tyr 
250 255 260 

CCC aac egg gtc ttc age ttc ctt gtt gga ggt get ate cea gag cac 868 
Pro Asn Arg Val Phe Ser Phe Leu Val Gly Gly Ala He Pro Glu His 
265 270 275 

ctg aag ggg acc act gac gac act ttc tac acc cac tac tea att gtc 916 
Leu Lys Gly Thr Thr Asp Asp Thr Phe Tyr Thr His Tyr Ser He Val 
280 285 290 295 

get tec ctg tct get aac tgg ggt tta ccc teg ctt ggt cgc tgg gat 964 
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Ala Ser Leu Ser Ala Asn Trp Gly Leu Pro Ser Leu Gly Arg Trp Asp 
300 305 310 

tgt ggc gcc aac ctg ctg aag atg gtc get gac aag acc ggc tat gtc 1012 
Cys Gly Ala Asn Leu Leu Lys Met Val Ala Asp Lys Thr Gly Tyr Val 
315 320 325 

aac tgg gaa gtt gat acc age aat gtc tac etc aac gag act tac cet 1060 
Asn Trp Glu Val Asp Thr Ser Asn Val Tyr Leu Asn Glu Thr Tyr Pro 
330 335 340 

gga cct atg tct acc gac aac tat tec tet aag tgg gcc gtt cet gee 1108 
Gly Pro Met Ser Thr Asp Asn Tyr Ser Ser Lys Trp Ala Val Pro Ala 
345 350 355 

acc aag ggc aaa tgc tct get ggc cat ggc att get gag gtc gtg aag 1156 
Thr Lys Gly Lys Cys Ser Ala Gly His Gly lie Ala Glu Val Val Lys 
360 365 370 375 

aat acc tac cac ggg ctt caa ccc acc tac gac tat gcc age cet gta 1204 
Asn Thr Tyr His Gly Leu Gin Pro Thr Tyr Asp Tyr Ala Ser Pro Val 
380 385 390 

ceg tat gac gtg ace agt gga aac aac gtc ggc ate aag tac cac ege 1252 
Pro Tyr Asp Val Thr Ser Gly Asn Asn Val Gly lie Lys Tyr His Arg 
395 400 405 ' 

act ctg gta tgt ate ctt tea tgt tct tec ctt tea tga 1291 
Thr Leu Val Cys lie Leu Ser Cys Ser Ser Leu Ser 
410 415 



18 
71 
PRT 

Penicillium chrysogenum 



<210> 
<211> 
<212> 
<213> 



<400> 18 

Met Leu Thr Lys Gin Thr Leu Leu Ala Phe Val Gly Ala Leu Ala Leu 
15 10 15 



Ala Thr Gly Thr Thr Thr Thr Glu Glu Thr Pro Thr Gin Ala Glu He 
20 25 30 



Asp Ala Ala Arg Ala Thr Ala Leu Pro Tyr Ser Pro Val Ser Asn Val 
35 40 45 



Lys Gly Leu Ala Phe Asp Arg Phe Val Asn He Trp Leu Glu Asn Thr 
50 55 60 



Val Gly Phe Pro Leu Asn He 
65 70 



<210> 19 
<211> 348 
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<212> PRT 

<213>. Penicillixun chrysogenum 
<400> 19 

Asp Phe Glu Pro Ala Ala Leu Asp Glu Asn Leu Ser Lys Leu Ala Lys 
15 10 15 

Glu Gly lie I*eu Leu Thr Asn Tyr Phe Ala lie Ser His Pro Ser Gin 
20 25 30 

Pro Asn Tyr Cys Ala Ser Ala Gly Gly Asp Thr Phe Gly Met Asp Asn 
35 40 45 

Asp Asp Phe Leu Gin lie Pro Ser Asn Val Ser Thr lie Ala Asp Leu 
50 55 60 

Phe Asp Thr Lys His lie Ser Trp Gly Glu Tyr Gin Glu Asp Met Pro 
65 70 75 80 

Tyr Ala Gly Tyr Gin Gly Lys Arg Tyr Pro Leu Ser Gly Pro Asn Gin 
85 90 95 

Tyr Val Arg Lys His Asn Pro Leu Val Leu Phe Asn Ser Val Thr Asp 
100 105 110 

Asp Ala Val Arg Pro Arg Gin lie Lys Asn Phe Thr Thr Phe Tyr Asp 
115 120 125 

Asp Leu Lys His His Ser Leu Pro Gin His Met Phe He Thr Pro Asn 
130 135 140 

Met Thr Asn Asp Ala His Asp Thr Asn He Thr Val Ala Gly Asn Trp 
145 150 155 160 

Val Asp Arg Phe Leu Ser Pro Leu Leu Lys Asn Glu Tyr Phe Thr Lys 
165 170 175 

Asp Ser Leu Val Leu Leu Thr Phe Asp Glu Gly Asp Thr Tyr Ser Tyr 
180 185 190 

Pro Asn Arg Val Phe Ser Phe Leu Val Gly Gly Ala He Pro Glu His 
195 200 205 

Leu Lys Gly Thr Thr Asp Asp Thr Phe Tyr Thr His Tyr Ser lie Val 
210 215 220 

Ala Ser Leu Ser Ala Asn Trp Gly Leu Pro Ser Leu <^ly Arg Trp Asp 
225 230 235 240 
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Cys Gly Ala Asn Leu Leu Lys Met Val Ala Asp Lys Thr Gly Tyr Val 
245 250 255 

Asn Trp Glu Val Asp Thr Ser Asn Val Tyr Leu Asn Glu Thr Tyr Pro 
260 265 270 

Gly Pro Met Ser Thr Asp Asn Tyr Ser Ser Lys Trp Ala Val Pro Ala 
275 280 285 

Thr Lys Gly Lys Cys Ser Ala Gly His Gly lie Ala Glu Val Val Lys 
290 295 300 

Asn Thr Tyr His Gly Leu Gin Pro Thr Tyr Asp Tyr Ala Ser Pro Val 
305 310 315 320 

Pro Tyr Asp Val Thr Ser Gly Asn Asn Val Gly lie Lys Tyr His Arg 
325 330 335 

Thr Leu Val Cys lie Leu Ser Cys Ser Ser Leu Ser 
340 345 

<210> 20 
<211> 1457 
<212> DNA 

<213> Aspergillus fumigatus 



<220> 

<221> CDS 

<222> {1)..(186) 

<220> 

<221> Intron 

<222> (187) . . (239) 

<220> 

<221> CDS 

<222> (240) . . (301) 

<220> 

<221> Intron 

<222> (302) , . (361) 
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<220> 

<221> CDS 

<222> (362) . . (1454) 



<400> 20 

atg aag cct tec gtc gcg act ttg ctt gcc act gtc tct ctg gtc tat 48 
Met Lys Pro Ser Val Ala Thr Leu Leu Ala Thr Val Ser Leu Val Tyr 
1 5 10 15 

get cag act get act gag aag gag cct teg ctg tct gcg ata gaa tct 96 
Ala Gin Thr Ala Thr Glu Lys Glu Pro Ser Leu Ser Ala lie Glu Ser 
20 25 30 



gca gea gcc tec ate cag cct tac tct ecc gtt teg aac gtt gag ggt 144 
Ala Ala Ala Ser He Gin Pro Tyr Ser Pro Val Ser Asn Val Glu Gly 
35 40 45 

gtt gca ttt aat cge ttc ttc caa gtg tgg ctt gag aat att 186 
Val Ala Phe Asn Arg Phe Phe Gin Val Trp Leu Glu Asn He 
50 55 60 

gtatgtgatc accttaccaa tcagagaatc ttgttcaaag ctgacctcag aag gat 242 

Asp 



tac gag gat get gcg gcg gat gag aac atg aaa tgg ctg gee teg caa 290 
Tyr Glu Asp Ala Ala Ala Asp Glu Asn Met Lys Trp Leu Ala Ser Gin 
65 70 75 

ggg ate ctg et gtaagaeetc atateggcca tetgeteaeg atttgaaett 341 

Gly He Leu Leu 

80 

cgaactaatt tactccacag c ace aat ttc tat gca gtc aeg cat cct tea 392 

Thr Asn Phe Tyr Ala Val Thr His Pro Ser 
85 90 

gag oca aac tac tgc get get gtt gga ggc gae aea ttt ggc atg gac 440 
Glu Pro Asn Tyr Cys Ala Ala Val Gly Gly Asp Thr Phe Gly Met Asp 
95 100 105 

aat gac aac ttt aac cag att cct gee aat gtt tct act gtc get gat 488 
Asn Asp Asn Phe Asn Gin He Pro Ala Asn Val Ser Thr Val Ala Asp 
110 115 120 125 

etc ctg gac ace aaa aac att get tgg gga gag tat cag gag eac tta 536 
Leu Leu Asp Thr Lys Asn He Ala Trp Gly Glu Tyr Gin Glu His Leu 
130 135 140 

cct tat cec gga ttc caa ggt ttc aac tat tec aac cag gag act tat 584 
Pro Tyr Pro Gly Phe Gin Gly Phe Asn Tyr Ser Asn Gin Glu Thr Tyr 
145 150 155 

gtc aat gac tat gtg egc aag cat aac cca ctg gtc ttg tat gac tct 632 
Val Asn Asp Tyr Val Arg Lys His Asn Pro Leu Val Leu Tyr Asp Ser 
160 165 170 

gtc ace aag aac age act cgt ttg cge cag ate aag aac ttt ace age 680 
Val Thr Lys Asn Ser Thr Arg Ijeu Arg Gin He Lys Asn Phe Thr Ser 
175 180 185 

ttc gag gac gae ctg gee aac aag aaa ctt cct caa tgg gea ttt ate 728 
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Phe Glu Asp Asp Leu Ala Asn Lys Lys Leu Pro Gin Trp Ala Phe lie 
190 195 200 205 

act cca aac atg acc aac gac get cat gac acc aac att act ttc gga 776 
Thr Pro Asn Met Thr Asn Asp Ala His Asp Thr Asn lie Thr Phe Gly 
210 215 220 

gcc aaa tgg gag cga age tgg att gcg ccc ttg etc aac aac tea tac 824 
Ala Lys Trp Glu Arg Ser Trp lie Ala Pro Leu Leu Asn Asn Ser Tyr 
225 230 235 

ttc atg aat gat acc eta ate eta ett acc ttt gat gag gat ggc act 872 
Phe Met Asn Asp Thr Leu He Leu Leu Thr Phe Asp Glu Asp Gly Thr 
240 245 250 

tat tec aag age aac aag ate ttc agt gtt ett etc ggt ggt gcc att 920 
Tyr Ser Lys Ser Asn Lys He Phe Ser Val Leu Leu Gly Gly Ala He 
255 260 265 

ccc gat gag ctg aag ggt act cag gac gat acg ttc tat acc cae tac 968 
Pro Asp Glu Leu Lys Gly Thr Gin Asp Asp Thr Phe Tyr Thr His Tyr 
270 275 280 285 

tea gtg att gcg tec gtg tec gcg aac tgg ggc ett cet teg ttg gga 1016 
Ser Val He Ala Ser Val Ser Ala Asn Trp Gly Leu Pro Ser Leu Gly 
290 295 300 

agg tgg gat tgt ggt gcg aac att ett gag att gtg gca aac aag acg 1064 
Arg Trp Asp Cys Gly Ala Asn He Leu Glu He Val Ala Asn Lys Thr 
305 . 310 315 

gga tat gtc aac tac gac gtt gac aca acc aat etc cgc etc aac gag 1112 
Gly Tyr Val Asn Tyr Asp Val Asp Thr Thr Asn Leu Arg Leu Asn Glu 
320 325 330 

acc tac ccc ggt ccc atg tea gcg ggc gaa tac teg aaa tac tec cet 1160 
Thr Tyr Pro Gly Pro Met Ser Ala Gly Glu Tyr Ser Lys Tyr Ser Pro 
335 340 345 

gtc tgg ecg aat gcc ttg ace egt ggt gac tgc tet get ggc cat ggc 1208 
Val Trp Pro Asn Ala Leu Thr Arg Gly Asp Cys Ser Ala Gly His Gly 
350 355 360 365 

att ttg gac att gtc aag gag ace tac gee aac acg gag cca aca tac 1256 
He Leu Asp He Val Lys Glu Thr Tyr Ala Asn Thr Glu Pro Thr Tyr 
370 375 380 

aac tat teg age ccc ttc cca tat gac act gcg age aac tac aac acc 1304 
Asn Tyr Ser Ser Pro Phe Pro Tyr Asp Thr Ala Ser Asn Tyr Asn Thr 
385 390 395 

aag gtg act gcc acc aaa aag aat gtc acc ggt aca cat aga agt tet 1352 
Lys Val Thr Ala Thr Lys Lys Asn Val Thr Gly Thr His Arg Ser Ser 
400 405 410 

tet tee tec tet ecg tea get age tec aac gcc get gtt tet get gtc 1400 
Ser Ser Ser Ser Pro Ser Ala Ser Ser Asn Ala Ala Val Ser Ala Val 
415 420 425 

get cet gca gee ggt gtc tet ggt etc etc ttg gga etc get eta aac 1448 ' 
Ala Pro Ala Ala Gly Val Ser Gly Leu Leu Leu Gly Leu Ala Leu Asn 
430 435 440 445 

ctg ett taa 1457 
Leu Leu 
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<210> 21 
<211> 447 
<212> PRT 

<213> Aspergillus fumigatus 
<400> 21 

Met Iiys Pro Ser Val Ala Thr Leu Leu Ala Thr Val Ser Leu Val Tyr 
15 10 15 

Ala Gin Thr Ala Thr Glu Lys Glu Pro Ser Leu Ser Ala lie Glu Ser 
20 25 30 

Ala Ala Ala Ser He Gin Pro Tyr Ser Pro Val Ser Asn Val Glu Gly 
35 40 45 

Val Ala Phe Asn Arg Phe Phe Gin Val Trp Leu Glu Asn He Asp .Tyr 
50 55 60 

Glu Asp Ala Ala Ala Asp Glu Asn Met Lys Trp Leu Ala Ser Gin Gly 
65 70 75 80 

He Leu Leu Thr Asn Phe Tyr Ala Val Thr His Pro Ser Glu Pro Asn 
85 90 95 

Tyr Cys Ala Ala Val Gly Gly Asp Thr Phe Gly Met Asp Asn Asp Asn 
100 105 110 

Phe Asn Gin He Pro Ala Asn Val Ser Thr Val Ala Asp Leu Leu Asp 
115 120 125 

Thr Lys Asn He Ala Trp Gly Glu Tyr Gin Glu His Leu Pro Tyr Pro 
130 135 140 

Gly Phe Gin Gly Phe Asn Tyr Ser Asn Gin Glu Thr Tyr Val Asn Asp 
145 150 155 160 

Tyr Val Arg Lys His Asn Pro Leu Val Leu Tyr Asp Ser Val Thr Lys 
165 170 175 

Asn Ser Thr Arg Leu Arg Gin He Lys Asn Phe Thr Ser Phe Glu Asp 
180 185 190 

Asp Leu Ala Asn Lys Lys Leu Pro Gin Trp Ala Phe He Thr Pro Asn 
195 200 205 

Met Thr Asn Asp Ala His Asp Thr Asn He Thr Phe Gly Ala Lys Trp 
210 215 220 
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Glu Arg Ser Trp lie Ala Pro Leu Leu Asn Asn Ser Tyr Phe Met Asn 
225 230 235 240 

Asp Thr Leu lie Leu Leu Thr Phe Asp Glu Asp Gly Thr Tyr Ser Lys 
245 250 255 

Ser Asn Lys lie Phe Ser Val Leu Leu Gly Gly Ala lie Pro Asp Glu 
260 265 270 

Leu Lys Gly Thr Gin Asp Asp Thr Phe Tyr Thr His Tyr Ser Val lie 
275 280 285 

Ala Ser Val Ser Ala Asn Trp Gly Leu Pro Ser Leu Gly Arg Trp Asp 
290 295 300 

Cys Gly Ala Asn lie Leu Glu lie Val Ala Asn Lys Thr Gly Tyr Val 
305 310 315 320 

Asn Tyr Asp Val Asp Thr Thr Asn Leu Arg Leu Asn Glu Thr Tyr Pro 
325 330 335 

Gly Pro Met Ser Ala Gly Glu Tyr Ser Lys Tyr Ser Pro Val Trp Pro 
340 345 350 

Asn Ala Leu Thr Arg Gly Asp Cys Ser Ala Gly His Gly lie Leu Asp 
355 360 365 

lie Val Lys Glu Thr Tyr Ala Asn Thr Glu Pro Thr Tyr Asn Tyr Ser 
370 375 380 

Ser Pro Phe Pro Tyr Asp Thr Ala Ser Asn Tyr Asn Thr Lys Val Thr 
385 390 395 400 

Ala Thr Lys Lys Asn Val Thr Gly Thr His Arg Ser Ser Ser Ser Ser 
405 410 415 

Ser Pro Ser Ala Ser Ser Asn Ala Ala Val Ser Ala Val Ala Pro Ala 
420 425 430 

Ala Gly Val Ser Gly Leu Leu Leu Gly Leu Ala Leu Asn Leu Leu 
. 435 440 445 

<210> 22 

<211> 8 

<212> PRT 

<213> Mycobacteritam tuberculosis 



<400> 22 
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Asn Asp Met His Asp Gly Ser lie 
1 5 
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